Non-human animals that select for light chain variable regions that bind antigen

ABSTRACT

Non-human animals, cells, methods and compositions for making and using the same are provided, wherein the non-human animals and cells comprise an immunoglobulin heavy chain locus that includes unrearranged human immunoglobulin light chain gene segments and an immunoglobulin light chain locus that includes a single rearranged human light chain variable region nucleotide sequence. The unrearranged human light chain gene segments may be operably linked to a heavy chain constant region nucleotide sequence and the rearranged human immunoglobulin light chain variable region nucleotide sequence may be operably linked to a light chain constant region nucleotide sequence. Also provided are methods for obtaining nucleic acid sequences that encode immunoglobulin light chain variable domains capable of binding an antigen in the absence of a cognate variable domain, and expressing such nucleic acid sequences in a host cell, e.g., to generate a multispecific antigen-binding protein.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims the benefit under 35 U.S.C. § 119(e) of U.S. Provisional Application Ser. No. 62/135,419, filed 19 Mar. 2015, which application is hereby incorporated by reference.

SEQUENCE LISTING

An official copy of the sequence listing is submitted concurrently with the specification electronically via EFS-Web as an ASCII formatted sequence listing with a file name of 2016-03-18-1150WO01-SEQ-LIST ST25.txt, a creation date of Mar. 18, 2016, and a size of about 8.62 kilobytes. The sequence listing contained in this ASCII formatted document is part of the specification and is herein incorporated by reference in its entirety.

FIELD OF INVENTION

Provided herein are immunoglobulin light chain variable (V_(L/CH×ULC)) domains that are derived from a immunoglobulin hybrid chain that is cognate to a universal light chain, and that may bind antigen independently (e.g., in absence of) a cognate variable domain of the universal light chain, genetically modified non-human animals and cells that express V_(L/CH×ULC) domains, nucleic acids that encode V_(L/CH×ULC) domains, antigen-binding proteins (e.g., multispecific antigen-binding proteins) comprising one or more V_(L/CH×ULC) domains, and in vitro methods of generating antigen-binding protein (e.g., multispecific antigen-binding proteins) comprising one or more V_(L/CH×ULC) domains.

BACKGROUND

A number of promising novel diagnostics and therapies are biologics, commonly based on a traditional antibody format. However, traditional antibody-based design may be limited as antigen binding typically requires an antibody molecule that includes four polypeptides: two identical immunoglobulin heavy chains and two identical immunoglobulin light chains. The present invention encompasses the recognition that there remains a need for improvement and diversification of immunoglobulin-based therapeutic design.

SUMMARY

The present invention provides improved technologies for the development, production, and/or use of antigen-binding proteins based on immunoglobulin format. The present invention encompasses the recognition that conventional antibody-based format imposes certain constraints on the technology. For example, the present invention recognizes that requiring an antigen-binding site to be comprised of heavy and light chain variable domains can restrict the available affinity and/or specificity that can be achieved with respect to some antigenic determinants.

The present invention provides technologies that solve these problems. Among other things, the present invention provides genetically engineered non-human animals that express a “universal” or “common” immunoglobulin light chain variable domain and are useful, for example, in the development and/or production of novel antigen-binding protein formats. Moreover, the present invention surprisingly demonstrates that use of such an animal expressing a universal immunoglobulin light chain can direct selection of partner immunoglobulin chains whose variable domain binding characteristics can dominate within an antigen-binding site, even when the partner (or cognate) immunoglobulin chain's variable domain is a light chain variable domain. Thus, contrary to expectations in the art, the present invention demonstrates that it is possible to develop immunoglobulin light chain variable regions that determine or control specificity and/or affinity of antigen-binding sites in which they participate, e.g., that bind antigen when associated with a universal light chain variable domain and/or in the absence of, i.e., independently of, a cognate universal light chain variable domain.

Thus, in some embodiments, the present invention provides antigen-binding proteins, including multispecific antigen-binding proteins, comprising one or more immunoglobulin light chain variable domains that bind antigen when associated with a universal light chain variable domain and/or independently of a cognate universal light chain variable domain. Also provided are technologies, e.g., non-human animals and in vitro recombinant methods, for the development, production, and or use of such immunoglobulin light chain variable domain sequences in which antigen specificity and affinity results solely or primarily from, and/or resides solely or primarily in, immunoglobulin light chain variable domain diversity.

Various aspects and embodiments described herein are based in part on the surprising discovery that genetically modified non-human animals that express binding proteins that contain immunoglobulin light chain variable domains that are operably linked to a heavy chain constant region and immunoglobulin light chain variable domains encoded by a rearranged light chain variable gene sequence (e.g., a rearranged light chain V_(L)J_(L) sequence) can solve various problems recognized herein and/or can provide surprising results. Non-human animals whose genome includes (i) a hybrid immunoglobulin chain locus containing unrearranged human light chain gene segments (e.g., V_(L) and J_(L) gene segments) operably linked to a heavy chain constant region sequence, e.g., at an endogenous heavy chain locus; and (ii) an immunoglobulin light chain locus containing a rearranged immunoglobulin light chain variable sequence (e.g., a single rearranged immunoglobulin light chain variable region sequence, such as for example a universal light chain variable region sequence) operably linked to a light chain constant gene can focus the mechanisms of antibody diversification on the unrearranged (i.e., diversifiable) immunoglobulin light chain variable gene segment(s) operably linked to the heavy chain constant region. Upon rearrangement, the unrearranged human light chain gene segments form a light chain variable region gene sequence that is operably linked to a heavy chain constant region gene sequence to form a sequence that encodes a immunoglobulin hybrid chain, i.e., an immunoglobulin polypeptide comprising a light chain variable domain fused with a heavy chain constant region. Non-human animals with the genomes described herein are able to generate antigen-binding proteins comprising dimeric immunoglobulin hybrid chains, each associated with cognate universal light chains in typical tetrameric antibody format, wherein the immunoglobulin hybrid chains comprise a light chain variable domain that is cognate with the light chain variable domain of the universal light chain, e.g., a V_(L/CH×ULC) variable domain.

As shown herein, a light chain variable V_(L/CH×ULC) domain that is derived from an immunoglobulin hybrid chain (e.g., is encoded by a V_(L)/J_(L) gene sequence that encodes a variable domain of an immunoglobulin hybrid chain) and that is cognate to a universal light chain variable domain is capable of binding an antigen of interest in the presence or absence of the cognate universal light chain variable domain. The immunoglobulin hybrid chain from which the V_(L/CH×ULC) domain is derived is preferably somatically hypermutated and is not a single domain antibody, e.g., preferably has a heavy chain constant region that has an isotype selected from the group consisting of IgD, IgG, IgE and IgA and comprises a functional C_(H)1 domain. Such a variable V_(L/CH×ULC) domain is also able to bind antigen when associated with a second and noncognate variable domain specific for a different epitope, and regardless of whether the variable V_(L/CH×ULC) domain is operably fused to a a heavy chain constant region or a light chain constant region.

Accordingly, provided herein are antigen-binding proteins comprising at least a first binding component comprising an immunoglobulin light chain variable domain, e.g., a light chain variable V_(L/CH×ULC) domain, wherein the V_(L/CH×ULC) domain is (1) derived from a immunoglobulin hybrid chain encoded by a light chain sequence operably linked to one or more heavy chain constant region genes, e.g., Igμ, Igδ, Igγ, Igα and/or Igε, each of which comprises a nucleotide sequence that encodes a functional C_(H)1 domain, and (2) cognate to a universal light chain encoded by a rearranged light chain sequence operably linked to a light chain constant region gene.

In some embodiments, the a light chain variable V_(L/CH×ULC) domain is a V_(κOH×ULC) domain, e.g., is derived from and/or encoded by, a κ light chain variable region nucleotide sequence, e.g., a human κ light chain variable region nucleotide sequence, e.g., a Vκ1-5, Vκ1-6, Vκ1-9, Vκ1-12, Vκ1-13, Vκ1-16, Vκ1-17, Vκ1-22, Vκ1-27, Vκ1-32, Vκ1-33, Vκ1-35, Vκ1-37, Vκ1-39, Vκ1D-8, Vκ1D-12, Vκ1D-13, Vκ1D-16, Vκ1D-17, Vκ1D-22, Vκ1D-27, Vκ1D-32, Vκ1D-33, Vκ1D-35, Vκ1D-37, Vκ1D-39, Vκ1D-42, Vκ1D-43, Vκ1-NL1, Vκ2-4, Vκ2-10, Vκ2-14, Vκ2-18, Vκ2-19, Vκ2-23, Vκ2-24, Vκ2-26, Vκ2-28, Vκ2-29, Vκ2-30, Vκ2-36, Vκ2-38, Vκ2-40, Vκ2D-10, Vκ2D-14, Vκ2D-18, Vκ2D-19, Vκ2D-23, Vκ2D-24, Vκ2D-26, Vκ2D-28, Vκ2D-29, Vκ2D-30, Vκ2D-36, Vκ2D-38, Vκ2D-40, Vκ3-7, Vκ3-11, Vκ3-15, Vκ3-20, Vκ3-25, Vκ3-31, Vκ3-34, Vκ3D-7, Vκ3D-7, VOD-11, Vκ3D-15, Vκ3D-15, Vκ3D-20, Vκ3D-25, Vκ3D-31, Vκ3D-34, Vκ3-NL1, Vκ3-NL2, Vκ3-NL3, Vκ3-NL4, Vκ3-NL5, Vκ4-1, Vκ5-2, Vκ6-21, Vκ6D-21, Vκ6D-41, or Vκ7-3 gene segment sequence, which may be rearranged with a (human) Jκ1, Jκ2, Jκ3, Jκ4, or Jκ5 gene segment, or somatically hypermutated variant thereof. In some embodiments, the light chain variable V_(L/CH×ULC) domain is a V_(λOH×ULC) domain, e.g., is derived from and/or encoded by, a λ light chain variable region nucleotide sequence, e.g., a human λ light chain variable region nucleotide sequence, e.g., a Vλ3-1, Vλ4-3, Vλ2-8, Vλ3-9, Vλ3-10, Vλ2-11, Vλ3-12, Vλ2-14, Vλ3-16, Vλ2-18, Vλ3-19, Vλ3-21, Vλ3-22, Vλ2-23, Vλ3-25, Vλ3-27, Vλ1-36, Vλ5-37, Vλ5-39, Vλ1-40, Vλ7-43, Vλ1-44, Vλ5-45, Vλ7-46, Vλ1-47, Vλ9-49, Vλ1-51, Vλ5-52, Vλ6-57, Vλ4-60, Vλ8-61 or Vλ4-69 gene segment sequence, which may be rearranged with a (human) Jλ1, Jλ2, Jλ3 or Jλ7 gene segment sequence, or a somatically hypermutated variant thereof.

Notably, rearrangement in a hybrid immunoglobulin locus of the unrearranged immunoglobulin V_(L) and J_(L) gene segments may result in a rearranged immunoglobulin light chain variable V_(L/CH×ULC) domain encoding gene sequence comprising 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more N additions. In one embodiment, the N additions and/or the somatic mutations observed in the rearranged immunoglobulin light chain gene encoding a V_(L/CH×ULC) domain are 1.5-fold, 2-fold, 2.5-fold, 3-fold, 3.5-fold, 4-fold, 4.5-fold, or at least 5-fold more than the number of N additions and/or somatic mutations observed in a rearranged light chain variable sequence (derived from the same V_(L) gene segment and the same J_(L) gene segment) that is rearranged at an endogenous light chain locus. The increased N additions in the V_(L/CH×ULC) encoding gene sequence may encode a light chain variable V_(L/CH×ULC) domain having more amino acids in the CDR3 compared to a light chain variable V_(L) domain encoded by a V_(L)/J_(L) gene sequence recombined at an endogenous light chain locus. Accordingly, in some embodiments, an antigen-binding protein provided herein comprises an immunoglobulin light chain variable V_(L/CH×ULC) domain, wherein the variable V_(L/CH×ULC) domain comprises a CDR3 having a length of 9, 10, 11, 12 or more amino acids. In some embodiments, the V_(L/CH×ULC) domain comprises a CDR3 that is 9 amino acids in length. In some embodiments, the V_(L/CH×ULC) domain comprises a CDR3 that is 10 amino acids in length. In some embodiments, the V_(L/CH×ULC) domain comprises a CDR3 that is 11 amino acids in length. In some embodiments, the V_(L/CH×ULC) domain comprises a CDR3 that is 12 amino acids in length.

In preferred embodiments, an antigen-binding protein as described herein is not a single domain binding protein, e.g., is not a heavy chain only binding protein. Accordingly, in some embodiments, a first binding component as described comprises a light chain variable V_(L/CH×ULC) domain fused to a constant region, e.g., a heavy chain constant region comprising at least a functional C_(H)1 domain or a light chain constant domain, wherein the variable V_(L/CH×ULC) domain is derived from a immunoglobulin hybrid chain that (1) comprises a functional C_(H)1 domain and (2) is cognate to a universal light chain, e.g., wherein the immunoglobulin light chain variable V_(L/CH×ULC) domain is a V_(κOH×ULC) or V_(λOH×ULC) domain (respectively encoded by a rearranged (human) Vκ/Jκ or Vλ/Jλ sequence) that is or was operably linked to a heavy chain constant region gene sequence encoding at least a functional C_(H)1 domain, and thus, may comprise a CDR3 that is 9, 10, 11, 12 or more amino acids in length.

Accordingly, in some embodiments, an antigen-binding protein as described herein comprises at least a first binding component comprising an immunoglobulin light chain variable V_(L/CH×ULC) domain fused to a heavy chain constant region comprising at least a functional C_(H)1 domain (e.g., the immunoglobulin light chain V_(L/CH×ULC) domain is fused to a C_(H)1 domain capable of forming a disulfide bond with a light chain constant region) and optionally further comprising a hinge region, a C_(H)2 domain, a C_(H)3 domain, a C_(H)4 domain or a combination thereof. In some embodiments, the heavy chain constant region is a non-human heavy chain constant region comprising at least a functional C_(H)1 domain. In some embodiments, the non-human heavy chain constant region is a rodent (e.g., rat or mouse) or chicken heavy chain constant region comprising at least a functional C_(H)1 domain. In some embodiments, the heavy chain constant region is a human heavy chain constant region comprising at least a functional C_(H)1 domain. In some embodiments, the heavy chain constant region (or C_(H)1 domain) has an isotype selected from the group consisting of IgM, IgD, IgG, IgE and IgA. In some embodiments, the variable V_(L/CH×ULC) domain is fused to an IgG heavy chain constant region (or C_(H)1 domain) having a subclass selected from the group consisting of IgG1, IgG2, IgG3, and IgG4. In some embodiments, the variable V_(L/CH×ULC) domain is fused to a human and mutated IgG1, IgG2, or IgG4 heavy chain constant region (or C_(H)1 domain) comprising a C_(H)3 domain, wherein the mutation is in the C_(H)3 domain of the IgG1, IgG2 or IgG4 heavy chain constant region and reduces or eliminates binding of the C_(H)3 domain to Protein A, e.g., wherein the mutation is selected from the group consisting of (a) 95R, and (b) 95R and 96F in the IMGT numbering system, or (a′) 435R, and (b′) 435R and 436F in the EU numbering system. In some embodiments, the human and mutated heavy chain constant region is a human and mutated IgG1 constant region and, in addition to the (a) 95R or (b) 95R and 96F mutation (in the IMGT numbering system), further comprises one to five modifications selected from the group consisting of 16E, 18M, 44S, 52N, 57M, and 82I in the IMGT exon numbering system, or 356E, 358M, 384S, 392N, 397M, and 422I in the EU numbering system. In some embodiments, the human heavy chain constant region is a human IgG2 constant region and, in addition to the (a) 95R or (b) 95R and 96F mutation (in the IMGT numbering system), further comprises one or two modifications selected from the group consisting of 44S, 52N, 82I in the IMGT exon numbering system, or 348S, 392N and 422I in the EU numbering system. In other embodiments, the human heavy chain constant region is a human IgG4 constant region and, in addition to the (a) 95R or (b) 95R and 96F mutation (in the IMGT numbering system), further comprises one to seven modifications selected from the group consisting of 15R, 44S, 52N, 57M, 69K, 79Q and 82I in the IMGT exon numbering system or 355R, 384S, 392N, 397M, 409K, 419Q and 422I in the EU numbering system and/or the modification 105P in the IGMT exon numbering system or 445P in the EU numbering system.

Additionally, in some embodiments, an antigen-binding protein as described herein comprises at least a first binding component comprising an immunoglobulin light chain variable V_(L/CH×ULC) domain fused to a light chain constant domain. In some embodiments, the light chain constant domain is a non-human light chain constant domain. In some embodiments, the non-human light chain constant domain is a rodent (e.g., rat or mouse) or chicken light chain constant domain. In some embodiments, the light chain constant domain is a human light chain constant domain. In some embodiments, the light chain constant domain is a light chain κ constant domain. In some embodiments, the light chain constant domain is a light chain λ constant domain.

In some embodiments, an immunoglobulin light chain variable V_(L/CH×ULC) (e.g., V_(κOH×ULC) or V_(λOH×ULC)) domain as described herein binds an antigen of interest in the absence of a cognate universal light chain. Accordingly, a first binding component as described herein may consist essentially or consist of the immunoglobulin light chain variable V_(L/CH×ULC) domain or the immunoglobulin light chain variable V_(L/CH×ULC) domain fused to a constant region, e.g., a heavy chain constant region comprising at least a functional C_(H)1 domain or a light chain constant region, wherein the variable V_(L/CH×ULC) domain is derived from a immunoglobulin hybrid chain that is cognate to a universal light chain, e.g., the immunoglobulin light chain variable V_(L/CH×ULC) domain is encoded by a rearranged (human) Vκ/Jκ or Vλ/Jλ sequence that is or was operably linked to a heavy chain constant region gene sequence, and thus, may comprise a CDR3 that is 9, 10, 11, 12 or more amino acids in length.

In other embodiments, the first binding component further comprises a cognate universal light chain variable domain in association with the immunoglobulin light chain V_(L/CH×ULC) variable domain, wherein the variable V_(L/CH×ULC) domain is derived from a immunoglobulin hybrid chain (e.g., the immunoglobulin light chain variable V_(L/CH×ULC) domain is encoded by a rearranged (human) Vκ/Jκ or VλJλ sequence that is or was operably linked to a heavy chain constant region gene sequence, and thus, may comprise a CDR3 that is 9, 10, 11, 12 or more amino acids in length), and wherein the immunoglobulin hybrid chain is cognate to a universal light chain encoded by a rearranged V_(L)/J_(L) gene sequence and wherein the universal light chain variable domain is encoded by the rearranged V_(L)/J_(L) gene sequence or a somatically hypermutated variant thereof.

In some embodiments, the universal light chain variable domain is encoded by or derived from a κ sequence, e.g., a human κ sequence, e.g., a Vκ1-5, Vκ1-6, Vκ1-8, Vκ1-9, Vκ1-12, Vκ1-13, Vκ1-16, Vκ1-17, Vκ1-22, Vκ1-27, Vκ1-32, Vκ1-33, Vκ1-35, Vκ1-37, Vκ1-39, Vκ1D-8, Vκ1D-12, Vκ1D-13, Vκ1D-16, Vκ1D-17, Vκ1D-22, Vκ1D-27, Vκ1D-32, Vκ1D-33, Vκ1D-35, Vκ1D-37, Vκ1D-39, Vκ1D-42, Vκ1D-43, Vκ1-NL1, Vκ2-4, Vκ2-10, Vλ2-14, Vλ2-18, Vκ2-19, Vκ2-23, Vκ2-24, Vκ2-26, Vκ2-28, Vκ2-29, Vκ2-30, Vκ2-36, Vκ2-38, Vκ2-40, Vκ2D-10, Vκ2D-14, Vκ2D-18, Vκ2D-19, Vκ2D-23, Vκ2D-24, Vκ2D-26, Vκ2D-28, Vκ2D-29, Vκ2D-30, Vκ2D-36, Vκ2D-38, Vκ2D-40, Vκ3-7, Vκ3-11, Vκ3-15, Vκ3-20, Vκ3-25, Vκ3-31, Vκ3-34, Vκ3D-7, Vκ3D-7, Vκ3D-11, Vκ3D-15, Vκ3D-15, Vκ3D-20, Vκ3D-25, Vκ3D-31, Vκ3D-34, Vκ3-NL1, Vκ3-NL2, Vκ3-NL3, Vκ3-NL4, Vκ3-NL5, Vκ4-1, Vκ5-2, Vκ6-21, Vκ6D-21, Vκ6D-41, or Vκ7-3 gene segment sequence rearranged with a (human) Jκ1, Jκ2, Jκ3, Jκ4, or Jκ5 gene segment sequence, or a somatically hypermutated variant thereof. In some embodiments, the universal light chain variable domain is encoded by or derived from a nucleotide sequence comprising a human Vκ1-39 gene segment sequence rearranged with a human Jκ5 gene segment sequence, or a somatically hypermutated variant thereof. In some embodiments, the universal light chain variable domain is encoded by or derived from a nucleotide sequence comprising a human Vκ3-20 gene segment sequence rearranged with a human Jκ1 gene segment sequence, or a somatically hypermutated variant thereof. In some embodiments, the universal light chain variable domain is encoded by or derived from a lambda sequence, e.g., a human Vλ3-1, Vλ4-3, Vλ2-8, Vλ3-9, Vλ3-10, Vλ2-11, Vλ3-12, Vλ2-14, Vλ3-16, Vλ2-18, Vλ3-19, Vλ3-21, Vλ3-22, Vλ2-23, Vλ3-25, Vλ3-27, Vλ1-36, Vλ5-37, Vλ5-39, Vλ1-40, Vλ7-43, Vλ5-45, Vλ7-46, Vλ1-47, Vλ9-49, Vλ1-51, Vλ5-52, Vλ6-57, Vλ4-60, Vλ8-61 or Vλ4-69 gene segment sequence rearranged with a (human) Jλ1, Jλ2, Jλ3 or Jλ7 gene segment sequence, or a somatically hypermutated variant thereof. In some embodiments the universal light chain variable domain is encoded by or derived from a nucleotide sequence comprising a Vκ2-14 gene segment sequence rearranged with a 0.120 gene segment sequence, or a somatically hypermutated variant thereof. In some embodiments the universal light chain variable domain is encoded by or derived from a nucleotide sequence comprising a Vλ2-14 gene segment sequence rearranged with a Jλ7 gene segment sequence, or a somatically hypermutated variant thereof.

A first binding component as described herein may comprise the immunoglobulin light chain variable V_(L/CH×ULC) (e.g., V_(κOH×ULC) or V_(λOH×ULC)) domain and the cognate universal light chain variable domain associated, e.g., linked, by a disulfide bond or a peptide linker. In some embodiments, a first binding component as described herein comprises a V_(L/CH×ULC) variable domain linked to a cognate universal light chain variable domain via a peptide linker, e.g., in an scFv-type format. In some embodiments, a first binding component as described herein comprises (i) an immunoglobulin light chain variable V_(L/CH×ULC) (e.g., V_(κOH×ULC) or V_(λOH×ULC)) domain fused to a heavy chain constant region comprising at least a functional C_(H)1 domain, e.g., the immunoglobulin light chain variable V_(L/CH×ULC) (e.g., V_(κOH×ULC) or V_(λOH×ULC)) domain fused to the functional C_(H)1 domain and (ii) a universal light chain variable domain fused to a light chain constant domain, wherein the C_(H)1 domain is linked to the light chain constant domain by a disulfide bond or a peptide linker. In one embodiment, the V_(L/CH×ULC) variable domain is fused to heavy chain constant region comprising at least a functional C_(H)1 domain (and optionally further comprising a hinge region, a C_(H)2 domain, a C_(H)3 domain, a C_(H)4 domain, or a combination thereof), the universal light chain variable domain is fused to a light chain constant domain, and the functional C_(H)1 domain is linked to the light chain constant domain by a disulfide bond. In another embodiment, the first binding component is in an scFab format, e.g., the V_(L/CH×ULC) variable domain is fused to heavy chain constant region comprising at least a functional C_(H)1 domain, the universal light chain variable domain is fused to a light chain constant domain, and the functional C_(H)1 domain is linked to the light chain constant domain by a peptide linker.

In some embodiments, a first binding component comprises a human V_(L/CH×ULC) (e.g., a human V_(κOH×ULC) or a human V_(κOH×ULC)) domain, optionally fused with a human heavy chain comprising a C_(H)1 domain or a human light chain constant domain. In one embodiment, an antigen-binding protein provided herein consists essentially or consists of only a first binding component as described herein, wherein the first binding component binds an antigen of interest.

Also provided are an antigen-binding proteins that, in addition to comprising a first binding component comprising a V_(L/CH×ULC) (e.g., V_(κOH×ULC) or V_(λOH×ULC)) variable domain as described herein, further comprise a second binding component that comprises a second immunoglobulin variable domain that is derived from a heavy or hybrid chain, wherein both the V_(L/CH×ULC) variable domain of the first component and the second variable domain of second binding component may be, and preferably are, cognate to a universal light chain variable domain derived from, e.g., encoded by, an identical rearranged light chain variable region gene sequence. As such, any differences in the universal light chain variable domains to which the V_(L/CH×ULC) and second variable domains are respectively cognate are the result of somatic hypermutation(s), e.g., may be determined to have arisen from somatic hypermutation or affinity maturation processes.

An antigen-binding protein as provided herein may comprise first and second binding components as described herein, wherein the first and second binding components comprise identical V_(L/CH×ULC) variable domains, and wherein the antigen-binding protein is monospecific, e.g., may specifically bind a single epitope of interest.

In some embodiments, the first and second binding components are not identical, e.g., bind different epitopes, which may be on the same antigen or may be on different antigens. Accordingly, an antigen-binding protein as described herein may be a multi-specific antigen-binding protein and comprise a (i) first binding component comprising a first variable domain, e.g., a V_(L/CH×ULC) domain (e.g., V_(κOH×ULC) or V_(λOH×ULC)), specific for a first epitope; and (ii) a second binding component comprising a second variable domain specific for a second epitope, wherein the second variable domain is either a second V_(L/CH×ULC) domain or a V_(H×ULC) domain (a heavy chain variable domain derived from a heavy chain encoded by V_(H)DJ_(H) gene sequence operably linked to a heavy chain constant region gene, wherein the heavy chain variable domain is cognate to a universal light chain variable domain), wherein the first and second epitopes are not identical, and wherein the first and second variable domains are each cognate to universal light chain variable domains that are derived from the same single rearranged light chain variable region gene sequence, and thus, are identical or are somatically hypermutated variants, e.g., differ in amino acid sequence only through somatic hypermutation. In some embodiments, the second variable domain is a second V_(L/CH×ULC) (e.g., V_(κOH×ULC) or V_(λOH×ULC)) domain that binds a second epitope that is different than the first epitope, although the first and second variable V_(L/CH×ULC) domains are cognate to universal light chain variable domains that are derived from the same single rearranged light chain variable region gene sequence, and thus, are identical or somatically hypermutated variants. In some embodiments, the second variable domain is an immunoglobulin heavy chain variable (V_(H×ULC)) domain that binds a second epitope different than the first epitope, wherein the V_(L/CH×ULC) variable domain of the first binding component and the V_(H×ULC) variable domain of the second binding component are cognate to universal light chain variable domains that are derived from same single rearranged light chain variable region gene sequence, and thus, are identical or somatically hypermutated variants. A multispecific antigen-binding protein provided herein comprising first and second binding components that are not identical may specifically bind more than one epitope of interest.

In some embodiments, wherein the second binding component comprises a second variable domain that is a V_(H×ULC) domain, the V_(H) domain is encoded by a heavy chain variable region nucleotide sequence, e.g., a human heavy chain variable region nucleotide sequence, e.g., any human V_(H), D, and J_(H) gene segment sequence present in the human repertoire, e.g., any human heavy chain variable gene segments described in IMGT database, www.imgt.org, or somatically hypermutated variants thereof.

Additionally, the first binding component and the second binding component may be associated by one or more peptide linkers, one or more disulfide bonds and/or one or more leucine zippers such that a multiple specific antigen-binding protein provided herein is in a form selected from the group consisting of a Fab-like structure, an scFab-like structure, a diabody-like structure, an scFv-like structure, an scFv-Fc like structure, an scFv-zipper like structure, or a tetrameric structure that is similar to a typical antibody that includes the cognate universal light chain. Accordingly, in some embodiments, either or both the V_(L/CH×ULC) (e.g., V_(κOH×ULC) or V_(λOH×ULC)) and second variable domains may be or may not be fused to a constant region (e.g., a heavy chain constant region comprising a functional C_(H)1 domain or a light chain constant domain) and/or may or may not be associated with a cognate universal light chain variable domain.

In some embodiments, the variable V_(L/CH×ULC) (e.g., V_(κOH×ULC) or V_(λOH×ULC)) domain of the first component is linked to the second variable (V_(L/CH×ULC) or V_(H×ULC)) domain of second binding component by a peptide linker such that the antigen-binding protein may have a diabody-like structure, an scFv-like structure, an scFv-Fc like structure, or an scFv-zipper like structure. In some embodiments, (1) at least one of (a) the V_(L/CH×ULC) variable domain of the first component or (b) the second variable domain of second binding component is fused to a (non-human or human) heavy chain constant region comprising at least a functional C_(H)1 domain (and optionally further comprising a hinge region, a C_(H)2 domain, a C_(H)3 domain, a C_(H)4 domain, or a combination thereof) and (2) the other of (a) the V_(L/CH×ULC) variable domain of the first component or (b) the second variable domain of second binding component is fused to a (non-human or human) light chain constant (C_(L)) domain, wherein the C_(H)1 domain is linked to the C_(L) domain by a disulfide bond such that the antigen-binding protein has a Fab-like structure, or wherein the C_(H)1 domain is linked to the C_(L) domain by a peptide linker such that the antigen-binding protein may have a scFab-like structure.

In some embodiments, both the first V_(L/CH×ULC) (e.g., V_(κOH×ULC) or V_(λOH×ULC)) and second (V_(L/CH×ULC) or V_(H×ULC)) variable domains are respectively fused to a first and second heavy chain constant regions, wherein each of the first and second heavy chain constant regions respectively comprises a first functional C_(H)1 domain and a second functional C_(H)1 domain (each heavy chain constant region optionally further comprising a hinge region, a C_(H)2 domain, a C_(H)3 domain, a C_(H)4 domain, or a combination thereof), and wherein the first and second heavy chain constant regions are linked, e.g., by a disulfide bond or a peptide linker. In some embodiments, at least one (or both) of the heavy chain constant regions is a non-human heavy chain constant region, e.g., a rodent (e.g., rat or mouse) or chicken heavy chain constant region. In some embodiments, at least one (or both) of the heavy chain constant regions is a human heavy chain constant region. In some embodiments, at least one (or both) of the heavy chain constant regions has an isotype selected from the group consisting of IgM, IgD, IgG, IgE and IgA. In some embodiments, at least one (or both) of the first variable V_(L/CH×ULC) and the second variable domains is fused to an IgG heavy chain constant region having a subclass selected from the group consisting of IgG1, IgG2, IgG3, and IgG4. In some embodiments, the first variable V_(L/CH×ULC) and the second variable domains are fused to heavy chain constant regions having an identical isotype and/or subclass, but optionally, wherein the heavy chain constant regions differ in their affinity to Protein A. In some embodiments, wherein both the first variable V_(L/CH×ULC) and the second variable domains are fused to a human IgG1, IgG2, or IgG4 heavy chain constant region, only one of the first variable V_(L/CH×ULC) and the second variable domain is fused to a human IgG1, IgG2, or IgG4 heavy chain constant region comprising a mutation in the C_(H)3 domain that reduces or eliminates binding of the C_(H)3 domain to Protein A, e.g., a mutation selected from the group consisting of (a) 95R, and (b) 95R and 96F in the IMGT numbering system, or (a′) 435R, and (b′) 435R and 436F in the EU numbering system. In some embodiments, the human heavy chain constant region is a human IgG1 constant region and further comprises one to five modifications selected from the group consisting of 16E, 18M, 44S, 52N, 57M, and 82I in the IMGT exon numbering system, or 356E, 358M, 384S, 392N, 397M, and 422I in the EU numbering system. In some embodiments, the human heavy chain constant region is a human IgG2 constant region and further comprises one or two modifications selected from the group consisting of 44S, 52N, 82I in the IMGT exon numbering system, or 348S, 392N and 422I in the EU numbering system. In other embodiments, the human heavy chain constant region is a human IgG4 constant region and further comprises one to seven modifications selected from the group consisting of 15R, 44S, 52N, 57M, 69K, 79Q and 82I in the IMGT exon numbering system or 355R, 384S, 392N, 397M, 409K, 419Q and 422I in the EU numbering system and/or the modification 105P in the IGMT exon numbering system or 445P in the EU numbering system.

In some embodiments, the first and second binding components may each respectively further comprise a first and second universal light chain variable domain respectively fused to a first and second light chain constant (C_(L)) domain, wherein the first and second C_(L) domains are respectively linked, e.g., by a disulfide bond, to the first and second C_(H)1 domains of the first and second heavy chain constant regions, wherein the first and second universal light chain variable domains are derived from same single rearranged light chain variable region gene sequence, and thus, are identical or somatically hypermutated variants.

In some embodiments, an antigen-binding protein as described herein comprises a first binding component comprising a human V_(L/CH×ULC) (V_(κOH×ULC) or V_(κOH×ULC)) domain, optionally fused with a human heavy chain comprising at least a C_(H)1 domain or a human light chain constant domain, a second binding component comprising a second human V_(L/CH×ULC) or a human V_(H×ULC) domain, optionally fused with a human heavy chain comprising a C_(H)1 domain or a human light chain constant domain, and optionally, a human universal light chain comprising a human universal light chain variable domain fused with a human light chain constant domain.

Non-human animals include, e.g., mammals and, in particular embodiments, rodents (e.g., mice, rats, or hamsters). In some embodiments, non-human animals include birds, e.g., chickens. The present invention provides non-human animals engineered to contain (e.g., in their germline genome and/or in genomes of their B cells) nucleic acid sequences as described herein and/or to express antigen-binding proteins (e.g., immunoglobulin chains and/or antibodies) as described herein, are provided by the present invention.

In some embodiments, the present invention particularly encompasses the recognition that it is desirable to engineer non-human animals to provide improved in vivo systems for the generation of immunoglobulin light chain domains in which antigen specificity and affinity is dominated by (e.g., results solely or primarily from, and/or resides solely or primarily in), immunoglobulin light chain variable domain diversity. In some embodiments, the present invention encompasses the recognition that it is desirable to engineer non-human animals to permit improved in vivo affinity maturation and/or selection for immunoglobulin light chain variable domains that bind antigen independent from an immunoglobulin heavy chain variable domain. In some embodiments, the present invention encompasses the recognition that non-human animals whose genome comprises unrearranged human light chain variable region gene segments operably linked to a heavy chain constant region and a rearranged human light chain variable region nucleic acid sequence are desirable, for example for use in selection immunoglobulin light chain variable domains (V_(κOH×ULC) or V_(λOH×ULC)) having some or all of the aforementioned characteristics.

In some embodiments, the present invention provides a non-human animal capable of generating a V_(L/CH×ULC) variable domain, wherein the non-human animal comprises in its germline genome (a) a first hybrid immunoglobulin locus, e.g., at an endogenous non-human heavy chain locus, comprising unrearranged (human) immunoglobulin light chain (V_(L) and J_(L)) gene segments capable of rearranging to form a rearranged (human) V_(L)/J_(L) gene sequence operably linked to an immunoglobulin heavy chain constant region nucleic acid sequence comprising one or more heavy chain constant region genes each one comprising a sequence encoding a functional C_(H)1 domain, e.g., an intact Igμ, gene and at least one of an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and an intact Igα gene, wherein the rearranged human V_(L)/J_(L) gene sequence operably linked to an immunoglobulin heavy chain constant region nucleic acid sequence encodes a hybrid immunoglobulin chain; and (b) a second light chain immunoglobulin locus, e.g., at an endogenous non-human light chain locus, comprising a rearranged human immunoglobulin light chain variable region nucleotide sequence operably linked to an immunoglobulin light chain constant region nucleic acid sequence, wherein the rearranged human immunoglobulin light chain variable region nucleotide sequence operably linked to an immunoglobulin light chain constant region nucleic acid sequence encodes a universal light chain, and wherein the non-human animal is capable of producing or does produce a cell, e.g., a lymphocyte, e.g., a B cell that expresses an antigen-binding protein comprising the immunoglobulin hybrid chain and the universal light chain, and wherein an immunoglobulin light chain variable domain of the immunoglobulin hybrid chain is a V_(L/CH×ULC) domain. In some embodiments, the non-human animal is a mammal or a bird. In some certain embodiments, the bird is a chicken. In some certain embodiments, the mammal is a rodent. In some embodiments, the rodent is selected from the group consisting of a mouse, a rat, and a hamster.

In some embodiments, a non-human animal of the present invention is homozygous for the rearranged human immunoglobulin light chain variable region nucleotide sequence. In some embodiments, a non-human animal of the present invention is heterozygous for the rearranged human immunoglobulin light chain variable region nucleotide sequence. In some embodiments, a non-human animal of the present invention is homozygous for the hybrid immunoglobulin locus. In some embodiments, a non-human animal of the present invention is heterozygous for the hybrid immunoglobulin locus. In some embodiments, the unrearranged (human) immunoglobulin light chain variable gene segments are operably linked to a non-human heavy chain constant region nucleic acid sequence comprising one or more heavy chain constant region genes each one comprising a sequence encoding a functional C_(H)1 domain, e.g., an intact Igμ gene and at least one of an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and an intact Igα gene. In some embodiments, the non-human heavy chain constant region nucleic acid sequence is a mouse, rat, or chicken heavy chain constant region nucleic acid sequence comprising one or more heavy chain constant region genes each one comprising a sequence encoding a functional C_(H)1 domain, e.g., an intact Igμ gene and at least one of an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and an intact Igα gene. In some embodiments, the non-human animal is a rodent, and the unrearranged human immunoglobulin light chain variable gene segments are operably linked to a human heavy chain constant region nucleic acid sequence comprising one or more heavy chain constant region genes each one comprising a sequence encoding a functional C_(H)1 domain, e.g., an intact Igμ gene and at least one of an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and an intact Igα gene.

In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is operably linked to a non-human light chain constant region nucleic acid sequence. In some certain embodiments, the non-human light chain constant region nucleic acid sequence is a mouse or a rat light chain constant region nucleic acid sequence. In some embodiments, the non-human animal is a rodent, and the rearranged human immunoglobulin light chain variable region nucleotide sequence is operably linked to a human light chain constant region nucleic acid sequence.

In some certain embodiments, the light chain constant region nucleic acid sequence is a kappa sequence. In some certain embodiments, the light chain constant region nucleic acid sequence is a lambda sequence.

In some embodiments, the second immunoglobulin locus is a light chain kappa locus. In some embodiments, the second immunoglobulin locus is a light chain lambda locus.

In some embodiments, the unrearranged human immunoglobulin V_(L) and J_(L) gene segments are Vκ and Jκ gene segments. In some embodiments, the unrearranged human immunoglobulin V_(L) and J_(L) gene segments are Vλ and Jλ gene segments.

In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises a human κ light chain variable region nucleotide sequence. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises a human λ light chain variable region nucleotide sequence.

In some embodiments, the first locus comprises one or more unrearranged human immunoglobulin V_(L) gene segments selected from the group consisting of Vκ1-5, Vκ1-6, Vκ1-8, Vκ1-9, Vκ1-12, Vκ1-13, Vκ1-16, Vκ1-17, Vκ1-22, Vκ1-27, Vκ1-32, Vκ1-33, Vκ1-35, Vκ1-37, Vκ1-39, Vκ1D-8, Vκ1D-12, Vκ1D-13, Vκ1D-16, Vκ1D-17, Vκ1D-22, Vκ1D-27, Vκ1D-32, Vκ1D-33, Vκ1D-35, Vκ1D-37, Vκ1D-39, Vκ1D-42, Vκ1D-43, Vκ1-NL1, Vκ2-4, Vκ2-10, Vκ2-14, Vκ2-18, Vκ2-19, Vκ2-23, Vκ2-24, Vκ2-26, Vκ2-28, Vκ2-29, Vκ2-30, Vκ2-36, Vκ2-38, Vκ2-40, Vκ2D-10, Vκ2D-14, Vκ2D-18, Vκ2D-19, Vκ2D-23, Vκ2D-24, Vκ2D-26, Vκ2D-28, Vκ2D-29, Vκ2D-30, Vκ2D-38, Vκ2D-40, Vκ3-7, Vii-11, Vκ3-15, Vκ3-20, Vκ3-34, Vκ3D-7, Vκ3D-7, Vκ3D-11, Vκ3D-15, Vκ3D-15, Vκ3D-20, Vκ3D-25, Vκ3D-31, Vκ3D-34, Vκ3-NL1, Vκ3-NL2, Vκ3-NL3, Vκ3-NL4, Vκ3-NL5, Vκ4-1, Vκ5-2, Vκ6-21, Vκ6D-21, Vκ6D-41, and Vκ7-3. In some embodiments, the first locus comprises unrearranged human immunoglobulin J_(L) gene segments that include Jκ1, Jκ2, Jκ3, Jκ4, and Jκ5.

In some embodiments, the V_(κ) gene segment in the rearranged human immunoglobulin light chain variable region nucleotide sequence is a (human germline) Vκ gene segment selected from the group consisting of Vκ1-5, Vκ1-6, Vκ1-8, Vκ1-9, Vκ1-12, Vκ1-13, Vκ1-16, Vκ1-17, Vκ1-22, Vκ1-27, Vκ1-33, Vκ1-35, Vκ1-37, Vκ1-39, Vκ1D-8, Vκ1D-12, Vκ1D-13, Vκ1D-16, Vκ1D-17, Vκ1D-22, Vκ1D-27, Vκ1D-32, Vκ1D-33, Vκ1D-35, Vκ1D-37, Vκ1D-39, Vκ1D-42, Vκ1D-43, Vκ1-NL1, Vκ2-4, Vκ2-10, Vκ2-14, Vκ2-18, Vκ2-19, Vκ2-23, Vκ2-24, Vκ2-26, Vκ2-28, Vκ2-29, Vκ2-30, Vκ2-36, Vκ2-38, Vκ2-40, Vκ2D-10, Vκ2D-14, Vκ2D-18, Vκ2D-19, Vκ2D-23, Vκ2D-24, Vκ2D-26, Vκ2D-28, Vκ2D-29, Vκ2D-30, Vκ2D-36, Vκ2D-38, Vκ2D-40, Vκ3-7, Vκ3-11, Vκ3-15, Vκ3-20, Vκ3-25, Vκ3-31, Vκ3-34, Vκ3D-7, Vκ3D-7, Vκ3D-11, Vκ3D-15, Vκ3D-15, Vκ3D-20, Vκ3D-25, Vκ3D-31, Vκ3D-34, Vκ3-NL1, Vκ3-NL2, Vκ3-NL3, Vκ3-NL4, Vκ3-NL5, Vκ4-1, Vκ5-2, Vκ6-21, Vκ6D-21, Vκ6D-41, and Vκ7-3. In some certain embodiments, the V_(L) gene segment is selected from the group consisting of Vκ1-39 and Vκ3-20. In some embodiments, the V_(K) gene segment is selected from the group consisting of a human germline Vκ1-39 gene segment and a human germline Vκ3-20 gene segment.

In some embodiments, the J_(L) gene segment in the rearranged human immunoglobulin light chain variable region nucleotide sequence is selected from the group consisting of lid, Jκ2, Jκ3, Jκ4, and Jκ5, e.g., a human germline Jκ1 gene segment, a human germline Jκ2 gene segment, a human germline Jκ3 gene segment, a human germline Jκ4 gene segment, and a human germline Jκ5 gene segment.

In some certain embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises Vκ1-39 and Jκ5. In some certain embodiments, the Vκ1-39 is rearranged with the Jκ5. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is set forth as SEQ ID NO:1. In some certain embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises Vκ3-20 and Jκ1. In some certain embodiments, the Vκ3-20 is rearranged with Jκ1. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is set forth as SEQ ID NO:2.

In some embodiments the non-human animal is a rodent, and wherein the light chain constant region nucleic acid sequence is a rat or a mouse Cκ constant region nucleic acid sequence.

In some embodiments, the non-human animal is a rodent, and wherein the heavy chain constant region nucleic acid sequence is a rat or mouse constant region sequence selected from the group consisting of Igμ, Igδ, Igγ, Igε, Igα, each of which encodes a functional C_(H)1 domain.

In some embodiments, the first locus comprises one or more unrearranged human immunoglobulin V_(L) gene segments selected from the group consisting of Vλ3-1, Vλ4-Vλ2-8, Vλ3-9, Vλ3-10, Vλ2-11, Vλ3-12, Vλ2-14, Vλ3-16, Vλ2-18, Vλ3-19, Vλ3-21, Vλ3-22, Vλ2-23, Vλ3-25, Vλ3-27, Vλ1-36, Vλ5-37, Vλ5-39, Vλ1-40, Vλ7-43, Vλ1-44, Vλ5-45, Vλ7-46, Vλ9-49, Vλ1-51, Vλ5-52, Vλ6-57, Vλ4-60, VAS-61 and Vλ4-69.

In some embodiments, the first locus comprises unrearranged human immunoglobulin J_(L) gene segment that include Jλ1, Jλ2, Jλ3 and Jλ7.

In some embodiments, the V_(L) gene segment in the rearranged human immunoglobulin light chain variable region nucleotide sequence is a (human germline) Vλ gene segment selected from the group consisting of Vλ3-1, Vλ4-3, Vλ2-8, Vλ3-9, Vλ3-10, Vλ2-11, Vλ3-12, Vλ2-14, Vλ3-16, Vλ2-18, Vλ3-19, Vλ3-21, Vλ3-22, Vλ2-23, Vλ3-25, Vλ3-27, Vλ5-37, Vλ5-39, Vλ1-40, Vλ7-43, Vλ1-44, Vλ5-45, Vλ7-46, Vλ9-49, Vλ5-52, Vλ6-57, Vλ4-60, Vλ8-61 and Vλ4-69. In some certain embodiments the V_(L) gene segment is Vλ2-14, e.g., a human germline Vλ2-14 gene segment.

In some embodiments, the J_(L) gene segment in the rearranged human immunoglobulin light chain variable region nucleotide sequence is selected from the group consisting of Jλ1, Jλ2, Jλ3 and Jλ7.

In some certain embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises Vλ2-14Jλ1. In some certain embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is Vλ2-14Jλ2. In some certain embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is Vλ2-14Jλ3. In some certain embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is Vλ2-14Jλ7. In some embodiments, the non-human animal is a rodent, and wherein the light chain constant region nucleic acid sequence is a rat or a mouse Cλ constant region nucleic acid sequence.

In some embodiments, the non-human animal is a rodent, and wherein the heavy chain constant region nucleic acid sequence is a rat or mouse constant region sequence selected from the group consisting of Igμ, Igδ, Igγ, Igε, Igα, and a combination thereof, each of which encodes a functional C_(H)1 domain.

In some embodiments, substantially all endogenous functional variable heavy chain V_(H), D, and J_(H) gene segments are deleted from an endogenous immunoglobulin heavy chain locus of the non-human animal or rendered non-functional. In some embodiments, substantially all endogenous functional light chain V_(L) and J_(L) gene segments are deleted from an endogenous immunoglobulin light chain locus of the non-human animal or rendered non-functional.

In some embodiments, the non-human animal comprises an integrated Adam6a gene, an Adam6b gene, or both. In some embodiments, a non-human animal comprises a functional ectopic mouse Adam6 gene.

In some embodiments, the first immunoglobulin locus comprises a plurality of copies of the rearranged human immunoglobulin light chain variable nucleotide sequence.

In some embodiments, the present invention provides a method of making a non-human animal, the method generally comprising modifying a germline genome of the non-human animal to comprise (i) a rearranged human immunoglobulin light chain variable region nucleotide sequence operably linked to an immunoglobulin light chain constant region nucleic acid sequence; and (ii) unrearranged human immunoglobulin light chain variable region gene segments (V_(L) and J_(L)) capable of rearranging to form a rearranged human V_(L)/J_(L) gene sequence operably linked to an immunoglobulin heavy chain constant region nucleic acid sequence. In some embodiments, the method comprises (a) modifying a genome of a non-human animal to delete or render non-functional all or substantially all (i) endogenous functional immunoglobulin heavy chain V_(H), D, and J_(H) gene segments and (ii) endogenous functional light chain V_(L) and J_(L) gene segments; (b) placing unrearranged human immunoglobulin V_(L) and J_(L) gene segments in the genome so that the unrearranged light chain variable gene segments are operably linked to a heavy chain constant region nucleic acid sequence; and (c) placing a rearranged human immunoglobulin light chain variable region nucleotide sequence in the genome so that the rearranged human immunoglobulin light chain variable region nucleotide sequence is operably linked to a light chain constant region nucleic acid sequence. In some embodiments, the method comprises (a) replacing all endogenous functional immunoglobulin heavy chain V_(H), D, and J_(H) gene segments at an endogenous heavy chain locus with unrearranged human immunoglobulin V_(L) and J_(L) gene segments so that the unrearranged light chain variable gene segments are operably linked to an endogenous heavy chain constant region nucleic acid sequence, and (b) replacing all endogenous functional light chain V_(L) and J_(L) gene segments at an endogenous light chain locus with a rearranged human immunoglobulin light chain variable region nucleotide sequence so that the rearranged human immunoglobulin light chain variable region nucleotide sequence is operably linked to an endogenous light chain constant region nucleic acid sequence.

In some embodiments, the non-human animal is a mammal or a bird. In some certain embodiments, the bird is a chicken. In some certain embodiments, the mammal is a rodent. In some embodiments, the rodent is selected from the group consisting of a mouse, a rat, and a hamster.

In some embodiments, the unrearranged human immunoglobulin V_(L) and J_(L) gene segments are operably linked to a non-human immunoglobulin heavy chain constant region nucleic acid sequence. In some embodiments, the non-human immunoglobulin heavy chain constant region nucleic acid sequence is a mouse or rat immunoglobulin heavy chain constant region nucleic acid sequence. In some embodiments, the non-human animal is a rodent, and the unrearranged human immunoglobulin light chain variable V_(L) and J_(L) gene segments are operably linked to a human heavy chain constant region nucleic acid sequence.

In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is operably linked to a non-human light chain constant region nucleic acid sequence. In some embodiments, the non-human light chain constant region nucleic acid sequence is a mouse or a rat light chain constant region nucleic acid sequence. In some embodiments, the non-human animal is a rodent, and the rearranged human immunoglobulin light chain variable region nucleotide sequence is operably linked to a human light chain constant region nucleic acid sequence. In some certain embodiments, the light chain constant region nucleic acid sequence is a kappa sequence. In some certain embodiments, the light chain constant region nucleic acid sequence is a lambda sequence. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is placed in a kappa light chain locus. In some certain embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is placed in a lambda light chain locus.

In some certain embodiments, the unrearranged human immunoglobulin V_(L) and J_(L) gene segments are Vκ and Jκ gene segments. In some certain embodiments, the unrearranged human immunoglobulin V_(L) and J_(L) gene segments are Vλ and Jλ gene segments.

In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises a human κ light chain variable region nucleotide sequence. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises a human λ light chain variable region nucleotide sequence.

In some embodiments, the unrearranged human immunoglobulin V_(L) gene segments include one or more of Vκ1-5, Vκ1-6, Vκ1-8, Vκ1-9, Vκ1-12, Vκ1-13, Vκ1-16, Vκ1-17, Vκ1-22, Vκ1-27, Vκ1-32, Vκ1-33, Vκ1-35, Vκ1-37, Vκ1-39, Vκ1D-8, Vκ1D-12, Vκ1D-13, Vκ1D-16, Vκ1D-17, Vκ1D-22, Vκ1D-27, Vκ1D-32, Vκ1D-33, Vκ1D-35, Vκ1D-37, Vκ1D-39, Vκ1D-42, Vκ1D-43, Vκ1-NL1, Vκ2-4, Vκ2-10, Vκ2-14, Vκ2-18, Vκ2-19, Vκ2-23, Vκ2-24, Vκ2-26, Vκ2-28, Vκ2-29, Vκ2-30, Vκ2-36, Vκ2-38, Vκ2-40, Vκ2D-10, Vκ2D-14, Vκ2D-18, Vκ2D-19, Vκ2D-23, Vκ2D-24, Vκ2D-26, Vκ2D-28, Vκ2D-29, Vκ2D-30, Vκ2D-36, Vκ2D-38, Vκ2D-40, Vκ3-7, Vκ3-11, Vκ3-15, Vκ3-20, Vκ3-25, Vκ3-31, Vκ3- 34, Vκ3D-7, Vκ3D-7, Vκ3D-11, Vκ3D-15, Vκ3D-15, Vκ3D-20, Vκ3D-25, Vκ3D-31, Vκ3D-34, Vκ3-NL1, Vκ3-NL2, Vκ3-NL3, Vκ3-NL4, Vκ3-NL5, Vκ4-1, Vκ5-2, Vκ6-21, Vκ6D-21, Vκ6D-41, and Vκ7-3. In some embodiments, the unrearranged human immunoglobulin J_(L) gene segments include Jκ1, Jκ2, Jκ3, Jκ4, and Jκ5.

In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises V_(L) and J_(L) gene segments. In some embodiments, the V_(L) gene segment in the rearranged human immunoglobulin light chain variable region is a (human germline) W gene segment selected from the group consisting of Vκ1-5, Vκ1-6, Vκ1-8, Vκ1-12, Vκ1-13, Vκ1-16, Vκ1-17, Vκ1-22, Vκ1-27, Vκ1-32, Vκ1-33, Vκ1-35, Vκ1-37, Vκ1-39, Vκ1D-8, Vκ1D-12, Vκ1D-13, Vκ1D-16, Vκ1D-17, Vκ1D-22, Vκ1D-27, Vκ1D-32, Vκ1D-33, Vκ1D-35, Vκ1D-37, Vκ1D-39, Vκ1D-42, Vκ1D-43, Vκ1-NL1, Vκ2-4, Vκ2-10, Vκ2-14, Vκ2-18, Vκ2-19, Vκ2-23, Vκ2-24, Vκ2-26, Vκ2-28, Vκ2-29, Vκ2-30, Vκ2-36, Vκ2-38, Vκ2-40, Vκ2D-10, Vκ2D-14, Vκ2D-18, Vκ2D-19, Vκ2D-23, Vκ2D-24, Vκ2D-26, Vκ2D-28, Vκ2D-29, Vκ2D-30, Vκ2D-36, Vκ2D-38, Vκ2D-40, Vκ3-7, Vκ3-11, Vκ3-15, Vκ3-20, Vκ3-25, Vκ3-31, Vκ3-34, Vκ3D-7, Vκ3D-7, Vκ3D-11, Vκ3D-15, Vκ3D-15, Vκ3D-20, Vκ3D-25, MOD-31, MOD-34, Vκ3-NL1, Vκ3-NL2, Vκ3-NL3, Vκ3-NL4, Vκ3-NL5, Vκ4-1, Vκ5-2, Vκ6-21, Vκ3D-21, Vκ3D-41, and Vκ7-3. In some certain embodiments, the V_(L) gene segment is selected from the group consisting of Vκ1-39 and Vκ3-20. In some embodiments, the Vκ gene segment is a human germline Vκ gene segment, e.g., a human germline Vκ1-39 gene segment or a human germline Vκ3-20 gene segment. In some embodiments, the J_(L) gene segment is selected from the group consisting of Jκ1, Jκ2, Jκ3, Jκ4, and Jκ5, e.g., the group consisting of a human germline Jκ1 gene segment, a human germline Jκ2 gene segment a human germline Jκ3 gene segment a human germline Jκ4 gene segment and a human germline Jκ5 gene segment. In some certain embodiments, the rearranged human immunoglobulin light chain variable nucleotide sequence comprises Vκ1-39 and Jκ5 (e.g., the Vκ1-39 is rearranged with the Jκ5). In some embodiments, the rearranged immunoglobulin light chain variable region nucleotide sequence comprises the sequence set forth as SEQ ID NO:1. In some certain embodiments, the rearranged human immunoglobulin light chain variable nucleotide sequence comprises Vκ3-20 and Jκ1 (e.g., the Vκ3-20 is rearranged with the Jκ1). In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises the sequence set forth as SEQ ID NO:2.

In some embodiments, the non-human animal is a rodent, and wherein the light chain constant region nucleic acid sequence is a rat or a mouse Cκ constant region nucleic acid sequence. In some embodiments, the non-human animal is a rodent, and wherein the heavy chain constant region nucleic acid sequence is a rat or mouse constant region sequence selected from the group consisting of Igμ, Igδ, Igγ, Igε, Igα, and a combination thereof, each of which encodes a functional C_(H)1 domain.

In some embodiments, the non-human animal comprises one or more unrearranged human immunoglobulin V_(L) gene segments selected from the group consisting of Vλ3-1, Vλ4-3, Vλ2-8, Vλ3-9, Vλ3-10, Vλ2-11, Vλ3-12, Vλ2-14, Vλ3-16, Vλ2-18, Vλ3-19, Vλ3-21, Vλ3-22, Vλ2-23, Vλ3-25, Vλ3-27, Vλ1-36, Vλ5-37, Vλ5-39, Vλ1-40, Vλ7-43, Vλ1-44, Vλ5-45, Vλ7-46, Vλ1-47, Vλ9-49, Vλ1-51, Vλ5-52, Vλ6-57, Vλ4-60, Vλ8-61 and Vλ4-69. In some embodiments, the non-human animal comprises unrearranged human immunoglobulin J_(L) gene segments that include Jλ1, Jλ2, Jλ3 and Jλ7.

In some embodiments, the V_(L) gene segment in the rearranged human immunoglobulin light chain variable region nucleotide sequence is a (human germline) Vλ gene segment selected from the group consisting of Vλ3-1, Vλ4-3, Vλ2-8, Vλ3-9, Vλ3-10, Vλ2-11, Vλ3-12, Vλ2-14, Vλ3-16, Vλ2-18, Vλ3-19, Vλ3-21, Vλ3-22, Vλ2-23, Vλ3-25, Vλ3-27, Vλ1-36, Vλ5-37, Vλ5-39, Vλ1-40, Vλ7-43, Vλ1-44, Vλ5-45, Vλ7-46, Vλ1-47, Vλ9-49, Vλ1-51, Vλ5-52, Vλ6-57, Vλ4-60, Vλ8-61 and Vλ4-69. In some certain embodiments, the V_(L) gene segment is Vλ2-14, e.g., a human germline Vλ2-14 gene segment. In some embodiments, the J_(L) gene segment in the rearranged human immunoglobulin light chain variable region nucleotide sequence is selected from the group consisting of Jλ1, Jλ2, Jλ3 and Jλ7. In some certain embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is Vλ2-14Jλ1. In some certain embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises Vλ2-14Jλ2. In some certain embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises Vλ2-14Jλ3. In some certain embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises Vλ2-14Jλ7.

In some embodiments, the non-human animal is a rodent, and wherein the light chain constant region nucleic acid sequence is a rat or a mouse Cλ constant region nucleic acid sequence.

In some embodiments, the non-human animal is a rodent, and wherein the heavy chain constant region nucleic acid sequence is a rat or mouse constant region sequence selected from the group consisting of Igμ, Igδ, Igγ, Igε, Igα, and a combination thereof, each of which encodes a functional C_(H)1 domain.

In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is placed at an endogenous immunoglobulin light chain locus in the genome. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is present in a germline genome of the non-human animal. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is placed at an ectopic locus in the genome. In some embodiments, the non-human animal comprises a plurality of copies of the rearranged human immunoglobulin light chain variable region nucleotide sequence.

In some embodiments, the non-human animal comprises an Adam6a gene, an Adam6b gene or both. In some embodiments, a non-human animal comprises a functional ectopic mouse Adam6 gene.

In some embodiments, the nucleic acid sequence encoding the universal light chain comprises one or more histidine codons that are not encoded by a corresponding human germline light chain variable gene segment.

In some embodiments, the present invention provides methods of using a genetically modified non-human animal provided herein or made according to a method disclosed herein, wherein the methods generally comprise isolating from the non-human animal a cell, e.g., a lymphocyte, e.g., a B cell, that expresses a hybrid immunoglobulin chain that comprises a V_(L/CH×ULC) domain fused to a heavy chain constant region, wherein the hybrid immunoglobulin chain is cognate to a universal light chain and/or obtaining from cell a nucleic acid encoding the V_(L/CH×ULC) domain of the hybrid immunoglobulin chain. In some embodiments, a method for obtaining a nucleic acid sequence that encodes an immunoglobulin light chain variable V_(L/CH×ULC) domain comprises (a) optionally immunizing a non-human animal with an antigen that comprises an epitope or immunogenic portion thereof, wherein the non-human animal comprises in its genome (i) a rearranged human immunoglobulin light chain variable region nucleotide sequence operably linked to an immunoglobulin light chain constant region nucleic acid sequence, and (ii) unrearranged human immunoglobulin light chain variable region gene segments (V_(L) and J_(L)) operably linked to an immunoglobulin heavy chain constant region nucleic acid sequence such that the non-human animal mounts an immune response; and isolating from the immunized non-human animal a cell that expresses a nucleic acid sequence that encodes a light chain variable V_(L/CH×ULC) domain that can bind the antigen and/or the nucleic acid sequence that encodes a light chain variable V_(L/CH×ULC) domain that can bind the antigen.

In some embodiments, the nucleic acid sequence that encodes the light chain variable V_(L/CH×ULC) domain that can bind the antigen is derived from the unrearranged human immunoglobulin light chain variable region gene segments (V_(L) and J_(L)) operably linked to a heavy chain constant region nucleic acid sequence.

In some embodiments, the isolating step is carried out via fluorescence-activated cell sorting (FACS) or flow cytometry. In some embodiments, the isolating step comprises obtaining from the immunized non-human animal a cell and obtaining from said cell the nucleic acid sequence that encodes the light chain V_(L/CH×ULC) domain that can bind the antigen, and wherein the cell is a lymphocyte. In some certain embodiments, the lymphocyte comprises natural killer cells, T cells, or B cells.

In some embodiments, the method further comprises fusing the lymphocyte with a cancer cell to form a hybridoma. In some certain embodiments, the cancer cell is a myeloma cell.

In some embodiments, the isolated nucleic acid sequence is fused with a nucleic acid sequence encoding an immunoglobulin constant region nucleic acid sequence.

In some embodiments, the non-human animal is a mammal or a bird. In some certain embodiments, the bird is a chicken. In some certain embodiments, the mammal is a rodent. In some embodiment, the rodent is selected from the group consisting of a mouse, a rat, and a hamster.

In some embodiments, the unrearranged human immunoglobulin light chain variable V_(L) and J_(L) gene segments are operably linked to a non-human immunoglobulin heavy chain constant region nucleic acid sequence. In some embodiments, the non-human immunoglobulin heavy chain constant region nucleic acid sequence is a mouse or rat immunoglobulin heavy chain constant region nucleic acid sequence, e.g., comprising one or more heavy chain constant region genes each one comprising a sequence encoding a functional CH1 domain, e.g., comprising at least an intact Igμ gene and at least one of an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and an intact Igα gene.

In some embodiments, the non-human animal is a rodent, and the unrearranged human immunoglobulin light chain variable V_(L) and J_(L) gene segments are operably linked to a human heavy chain constant region nucleic acid sequence, e.g., comprising one or more heavy chain constant region genes each one comprising a sequence encoding a functional CH1 domain, e.g., comprising at least an intact. Igμ gene and at least one of an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and an intact Igα gene.

In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is operably linked to a non-human light chain constant region nucleic acid sequence. In some embodiments, the non-human light chain constant region nucleic acid sequence is a mouse or a rat light chain constant region nucleic acid sequence.

In some embodiments, the non-human animal is a rodent, and the rearranged human immunoglobulin light chain variable region nucleotide sequence is operably linked to a human light chain constant region nucleic acid sequence. In some certain embodiments, the light chain constant region nucleic acid sequence is a kappa sequence. In some certain embodiments, the light chain constant region nucleic acid sequence is a lambda sequence.

In some embodiments, the unrearranged human immunoglobulin V_(L) and J_(L) gene segments are human Vκ and Jκ gene segments. In some embodiments, the unrearranged human immunoglobulin V_(L) and J_(L) gene segments are human Vλ and Jλ gene segments.

In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises a human κ light chain variable region nucleotide sequence. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises a human λ light chain variable region nucleotide sequence.

In some embodiments, the unrearranged human immunoglobulin V_(L) gene segments include one or more of Vκ1-5, Vκ1-6, Vκ1-8, Vκ1-9, Vκ1-12, Vκ1-13, Vκ1-16, Vκ1-17, Vκ1-22, Vκ1-27, Vκ1-32, Vκ1-33, Vκ1-35, Vκ1-37, Vκ1-39, Vκ1D-8, Vκ1D-12, Vκ1D-13, Vκ1D-16, Vκ1D-17, Vκ1D-22, Vκ1D-27, Vκ1D-32, Vκ1D-33, Vκ1D-35, Vκ1D-37, Vκ1D-39, Vκ1D-42, Vκ1D-43, Vκ1-NL1, Vκ2-4, Vκ2-10, Vκ2-14, Vκ2-18, Vκ2-19, Vκ2-23, Vκ2-24, Vκ2-26, Vκ2-28, Vκ2-29, Vκ2-30, Vκ2-36, Vκ2-38, Vκ2-40, Vκ2D-10, Vκ2D-14, Vκ2D-18, Vκ2D-19, Vκ2D-23, Vκ2D-24, Vκ2D-26, Vκ2D-28, Vκ2D-29, Vκ2D-30, Vκ2D-36, Vκ2D-38, Vκ2D-40, Vκ3-7, Vκ3-11, Vκ3-15, Vκ3-20, Vκ3-25, Vκ3-31, Vκ3- 34, Vκ3D-7, Vκ3D-7, Vκ3D-11, Vκ3D-15, Vκ3D-15, Vκ3D-20, Vκ3D-25, Vκ3D-31, Vκ3D-34, Vκ3-NL1, Vκ3-NL2, Vκ3-NL3, Vκ3-NL4, Vκ3-NL5, Vκ4-1, Vκ5-2, Vκ6-21, Vκ6D-21, Vκ6D-41, and Vκ7-3. In some embodiments, the unrearranged human immunoglobulin J_(L) gene segments include one or more of Jκ1, Jκ2, Jκ3, Jκ4, and Jκ5.

In some embodiments, the V_(L) gene segment in the rearranged human immunoglobulin light chain variable region is a (human germline) Vκ gene segment selected from the group consisting of Vκ1-5, Vκ1-6, Vκ1-8, Vκ1-9, Vκ1-12, Vκ1-13, Vκ1-16, Vκ1-17, Vκ1-22, Vκ1-27, Vκ1-32, Vκ1-33, Vκ1-35, Vκ1-37, Vκ1-39, Vκ1D-8, Vκ1D-12, Vκ1D- 13, Vκ1D-16, Vκ1D-17, Vκ1D-22, Vκ1D-27, Vκ1D-32, Vκ1D-33, Vκ1D-35, Vκ1D-37, Vκ1D-39, Vκ1D-42, Vκ1D-43, Vκ1-NL1, Vκ2-4, Vκ2-10, Vκ2-14, Vκ2-18, Vκ2-19, Vκ2-23, Vκ2-24, Vκ2-26, Vκ2-28, Vκ2-29, Vκ2-30, Vκ2-36, Vκ2-38, Vκ2-40, Vκ2D-10, Vκ2D- 14, Vκ2D-18, Vκ2D-19, Vκ2D-23, Vκ2D-24, Vκ2D-26, Vκ2D-28, Vκ2D-29, Vκ2D-30, Vκ2D-36, Vκ2D-38, Vκ2D-40, Vκ3-7, Vκ3-11, Vκ3-15, Vκ3-20, Vκ3-25, Vκ3-31, Vκ3-34, Vκ3D-7, Vκ3D-7, Vκ3D-11, Vκ3D-15, Vκ3D-15, Vκ3D-20, Vκ3D-25, Vκ3D-31, Vκ3D-34, Vκ3-NL1, Vκ3-NL2, Vκ3-NL3, Vκ3-NL4, Vκ3-NL5, Vκ4-1, Vκ5-2, Vκ6-21, Vκ6D-21, Vκ6D-41, and Vκ7-3. In some certain embodiments, the V_(L) gene segment is selected from the group consisting of Vκ1-39 (e.g., a human germline Vκ1-39 gene segment) and Vκ3-20 (e.g., a human germline Vκ3-20 gene segment). In some embodiments, the J_(L) gene segment is selected from the group consisting of Jκ1 (e.g., a human germline Jκ1 gene segment), Jκ2 (e.g., a human germline Jκ2 gene segment), Jκ3 (e.g., a human germline Jκ3 gene segment), Jκ4 (e.g., a human germline Jκ4 gene segment), and Jκ5 (e.g., a human germline Jκ5 gene segment). In some certain embodiments, the rearranged human immunoglobulin light chain variable nucleotide sequence comprises Vκ1-39 and Jκ5 (e.g., the Vκ1-39 is rearranged with the Jκ5, e.g., a human germline Vκ1-38 gene segment is rearranged with a human germline Jκ5 gene segment). In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises a sequence set forth as SEQ ID NO:1. In some certain embodiments, the rearranged human immunoglobulin light chain variable nucleotide sequence comprises Vκ3-20 and Jκ1 (e.g., the Vκ3-20 is rearranged with the Jκ1, e.g., a human germline Vκ3-20 gene segment rearranged with a human germline Jκ1 gene segment). In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises a sequence set forth as SEQ ID NO:2. In some embodiments, the non-human animal is a rodent, and wherein the light chain constant region nucleic acid sequence is a rat or a mouse OK constant region nucleic acid sequence.

In some embodiments, the non-human animal is a rodent, and wherein the heavy chain constant region nucleic sequence is a rat or mouse constant region sequence selected from the group consisting of Igμ, Igδ, Igγ, Igε, Igα, and a combination thereof, each of which encodes a functional C_(H)1 domain.

In some embodiments, the unrearranged human immunoglobulin V_(L) gene segments include one or more of Vλ3-1, Vλ4-3, Vλ2-8, Vλ3-9, Vλ3-10, Vλ2-11, Vλ3-12, Vλ2-14, Vλ3-16, Vλ2-18, Vλ3-19, Vλ3-21, Vλ3-22, Vλ2-23, Vλ3-25, Vλ3-27, Vλ1-36, Vλ5-37, Vλ5-39, Vλ1-40, Vλ7-43, Vλ1-44, Vλ5-45, Vλ7-46, Vλ1-47, Vλ9-49, Vλ1-51, Vλ5-52, Vλ6-57, Vλ4-60, Vλ8-61 and Vλ4-69. In some embodiments, the non-human animal comprises unrearranged human immunoglobulin J_(L) gene segments that include Jλ1, Jλ2, Jλ3 and Jλ7.

In some embodiments, the V_(L) gene segment in the rearranged human immunoglobulin light chain variable region nucleotide sequence is a (human germline) Vλ gene segment selected from the group consisting of Vλ3-1, Vλ4-3, Vλ3-9, Vλ3-10, Vλ2-11, Vλ3-12, Vλ2-14, Vλ3-16, Vλ2-18, Vλ3-19, Vλ3-21, Vλ3-22, Vλ2-23, Vλ3-25, Vλ3-27, Vλ1-36, Vλ5-37, Vλ5-39, Vλ1-40, Vλ7-43, Vλ1-44, Vλ5-45, Vλ7-46, Vλ9-49, Vλ1-51, Vλ5-52, Vλ6-57, Vλ4-60, Vλ8-61 and Vλ4-69. In some certain embodiments the V_(L) gene segment is Vλ2-14, e.g., a human germline Vλ2-14 gene segment. In some embodiments, the J_(L) gene segment in the rearranged human immunoglobulin light chain variable region nucleotide sequence is selected from the group consisting of Jλ1, Jλ2, Jλ3 and Jλ7. In some certain embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises Vλ2-14Jλ1. In some certain embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises Vλ2-14Jλ2. In some certain embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises Vλ2-14Jλ3. In some certain embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises Vλ2-14Jλ7. In some embodiments, the non-human animal is a rodent and the light chain constant region nucleic acid sequence is a rat or a mouse Cλ constant region nucleic acid sequence.

In some embodiments, the non-human animal is a rodent, and wherein the heavy chain constant region nucleic acid sequence is a rat or mouse constant region sequence selected from the group consisting of Igμ, Igδ, Igγ, Igε, Igα, and a combination thereof, each of which encodes a functional C_(H)1 domain.

In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is at an endogenous immunoglobulin light chain locus in the genome. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is present in a germline genome of the non-human animal. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is integrated into a transcriptionally active locus in the genome. In some embodiments, the non-human animal comprises a plurality of copies of the rearranged human immunoglobulin light chain variable region nucleotide sequence.

In some embodiments, the non-human animal comprises an integrated Adam6a gene, an Adam6b gene or both. In some embodiments, a non-human animal comprises a functional ectopic mouse Adam6 gene.

In some embodiments, the nucleic acid sequence encoding the universal light chain comprises one or more histidine codons that are not encoded by a corresponding human germline light chain variable gene segment.

In some embodiments, the present invention provides a method for making an antigen-binding protein that comprises a V_(L/CH×ULC) (V_(κOH×ULC) or V_(λOH×ULC)) domain, the method generally comprising expressing in a host cell a first nucleic acid comprising a nucleic acid sequence that encodes a V_(L/CH×ULC) domain, optionally operably linked with a heavy chain constant region gene comprising a functional C_(H)1 domain encoding sequence or a light chain constant region gene, wherein the V_(L/CH×ULC) domain is cognate to a universal light chain variable domain, and wherein the antigen-binding protein is not a single domain antigen binding protein. In some embodiments, the nucleic acid sequence that encodes the V_(L/CH×ULC) domain is isolated from non-human animal comprising in its genome (i) a rearranged human immunoglobulin light chain variable region nucleotide sequence operably linked to an immunoglobulin light chain constant region nucleic acid sequence; and (ii) unrearranged human immunoglobulin light chain variable region gene segments (V_(L) and J_(L)) operably linked to an immunoglobulin heavy chain constant region nucleic acid sequence, wherein the nucleic acid sequence that encodes the V_(L/CH×ULC) domain is derived from the unrearranged human immunoglobulin light chain variable region gene segments (V_(L) and J_(L)) operably linked to an immunoglobulin heavy chain constant region nucleic acid sequence, e.g., comprising one or more heavy chain constant region genes each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising at least an intact Igμ gene and at least one of an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and an intact Igα gene. In some embodiments, the method further comprises (a) optionally immunizing a non-human animal with an antigen that comprises an epitope or immunogenic portion thereof, wherein the non-human animal comprises in its genome (i) a rearranged human immunoglobulin light chain variable region nucleotide sequence operably linked to an immunoglobulin light chain constant region nucleic acid sequence; and (ii) unrearranged human immunoglobulin light chain variable region gene segments (V_(L) and J_(L)) capable of rearranging to form a rearranged V_(L)/J_(L) gene sequence operably linked to an immunoglobulin heavy chain constant region nucleic acid sequence, such that the non-human animal mounts an immune response to the epitope or immunogenic portion thereof prior to (b) isolating from the non-human animal a nucleic acid sequence that encodes a light chain variable domain that specifically binds the epitope or immunogenic portion thereof and is derived from the rearranged V_(L)/J_(L) gene sequence, which is operably linked to an immunoglobulin heavy chain constant region nucleic acid. Additional embodiments include methods comprising (c) employing the isolated nucleic acid sequence in an expression construct optionally operably linked to a human immunoglobulin constant region nucleic acid sequence; prior to (d) expressing the nucleic acid sequence or expression construct comprising same in a production cell line, e.g., a host cell, to obtain an antigen-binding protein.

In some embodiments, the method for making an antigen-binding protein that comprises a V_(L/CH×ULC) domain comprises co-expressing in a host cell (i) a first nucleic acid comprising a nucleic sequence that encodes a first binding component comprising a first variable domain, e.g., a V_(L/CH×ULC) domain, specific for a first epitope, optionally operably linked with a first heavy chain constant region gene comprising a functional C_(H)1 domain encoding sequence or a first light chain constant region gene, and (ii) a second nucleic acid encoding a second component comprising a second variable domain specific for a second epitope, wherein the second variable domain is either a second V_(L/CH×ULC) domain or a V_(H×ULC) domain, wherein the first and second epitopes are not identical, and wherein the first and second variable domains are each cognate to universal light chain variable domains that are derived from the same single rearranged light chain variable region gene sequence, and thus, are identical or are somatically hypermutated variants, e.g., differ in amino acid sequence only through somatic hypermutation.

Thus, in some embodiments, the method comprises (a) immunizing a second non human animal with an antigen that comprises an epitope or immunogenic portion thereof, wherein the second non-human animal comprises in its genome (i) a rearranged human immunoglobulin light chain variable region nucleotide sequence operably linked to an immunoglobulin light chain constant region nucleic acid sequence; and (ii) either unrearranged human immunoglobulin light chain variable region gene segments (V_(L) and J_(L)) capable of rearranging to form a rearranged V_(L)/J_(L) gene sequence (that encodes the second V_(L/CH×ULC) domain of the second binding component) or unrearranged human immunoglobulin heavy chain variable region gene segments (V_(H), D and J_(H)) capable of rearranging to form a rearranged V_(H)/D/J_(H) gene sequence (that encodes the V_(H×ULC) domain of the second binding component) operably linked to an immunoglobulin heavy chain constant region nucleic acid sequence such that the non-human animal mounts an immune response to the epitope or immunogenic portion thereof prior to (b) isolating from the non-human animal a second nucleic acid sequence that encodes the second V_(L/CH×ULC) or V_(H×ULC) domain that specifically binds the second epitope or immunogenic portion thereof. Additional embodiments include methods comprising (c) employing the isolated second nucleic acid sequence in an expression construct, optionally operably linked to a human immunoglobulin constant region nucleic acid sequence; prior to (d) expressing of the first and second nucleic acid sequences or expression construct(s) comprising same in a production cell line, e.g., a host cell, to obtain an antigen-binding protein, wherein the antigen binding protein is not a single domain antigen binding protein.

In additional embodiments, the methods further comprise co-expressing in the production host cell the first nucleic acid encoding a first V_(L/CH×ULC) domain (or expression construct comprising same), optionally the second nucleic acid encoding a second V_(L/CH×ULC) domain or V_(H×ULC) domain (or expression construct comprising same) and a nucleotide sequence comprising a rearranged V_(L)/J_(L) gene sequence that encodes a human universal light chain variable domain, or somatically hypermutated variant thereof, that is cognate to the V_(L/CH×ULC) domain and the optional second V_(L/CH×ULC) domain or V_(H×ULC) domain. In some embodiments, the nucleotide sequence encodes the universal light chain variable domain fused to a human light chain constant domain.

The first nucleic acid sequence, and either or both the second nucleic acid sequence and the nucleotide sequence comprising a rearranged V_(L)/J_(L) gene sequence encoding the human universal light chain variable domain, may be employed in the same or different expression constructs, wherein the one or more expression constructs express the antigen binding protein, e.g., the first binding component, and either or both the second binding component and universal light chain, in a format selected from the group consisting of a Fab-like structure, an scFab-like structure, a diabody like structure, an scFv-like structure, an scFv-Fc like structure, an scFv-zipper like structure, and a tetrameric structure that is similar to a typical antibody and that includes the cognate universal light chain. Accordingly, in some embodiments, either or both the first and second nucleic acid sequences may respectively encode the first variable V_(L/CH×ULC) and second variable (V_(L/CH×ULC) or V_(H×ULC)) domain fused or not fused to a constant region, e.g., a (human) heavy chain constant region comprising a functional C_(H)1 domain or a (human) light chain constant domain.

In some embodiments, either or both first and second nucleic acid sequences comprise a heavy chain constant region nucleic acid that encodes a human heavy chain constant region having an isotype selected from the group consisting of IgM, IgD, IgG, IgE and IgA, e.g., an IgG heavy chain constant region having a subclass selected from the group consisting of IgG1, IgG2, IgG3, and IgG4. In some embodiments, the first and second nucleic acid sequence encode the first variable V_(L/CH×ULC) and the second variable (V_(L/CH×ULC) or V_(H×ULC)) domain fused to heavy chain constant regions having an identical isotype and/or subclass, but optionally, wherein the heavy chain constant regions differ in their affinity to Protein A. In some embodiments, wherein both the first variable V_(L/CH×ULC) and the second variable (V_(L/CH×ULC) or V_(H×ULC)) domains are fused to a human IgG1, IgG2, or IgG4 heavy chain constant region, only one of the first variable V_(L/CH×ULC) and the second variable (V_(L/CH×ULC) or V_(H×ULC)) domain is fused to a human IgG1, IgG2, or IgG4 heavy chain constant region comprising a mutation in the CH3 domain that reduces or eliminates binding of the CH3 domain to Protein A, e.g., a mutation selected from the group consisting of (a) 95R, and (b) 95R and 96F in the IMGT numbering system, or (a′) 435R, and (b′) 435R and 436F in the EU numbering system. In some embodiments, the human heavy chain constant region is a human IgG1 constant region and further comprises one to five modifications selected from the group consisting of 16E, 18M, 44S, 52N, 57M, and 82I in the IMGT exon numbering system, or 356E, 358M, 384S, 392N, 397M, and 422I in the EU numbering system. In some embodiments, the human heavy chain constant region is a human IgG2 constant region and further comprises one or two modifications selected from the group consisting of 44S, 52N, 82I in the IMGT exon numbering system, or 348S, 392N and 422I in the EU numbering system. In other embodiments, the human heavy chain constant region is a human IgG4 constant region and further comprises one to seven modifications selected from the group consisting of 15R, 44S, 52N, 57M, 69K, 79Q and 82I in the IMGT exon numbering system or 355R, 384S, 392N, 397M, 409K, 419Q and 422I in the EU numbering system and/or the modification 105P in the IGMT exon numbering system or 445P in the EU numbering system.

In some embodiments, at least one of the unrearranged human immunoglobulin light chain V_(L) or J_(L) gene segments encode one or more histidine residues that are not encoded by a corresponding human germline light chain variable gene segment.

In some embodiments, the first and/or second non-human animal from which the first and second nucleic acid sequences are derived is a mammal or a bird. In some certain embodiments, the bird is a chicken. In some certain embodiments, the mammal is a rodent. In some embodiments, the rodent is selected from the group consisting of a mouse, a rat, and a hamster.

In some embodiments, the human immunoglobulin light chain variable V_(L) and J_(L) gene segments are operably linked to a non-human immunoglobulin heavy chain constant region nucleic acid sequence, e.g., comprising one or more heavy chain constant region genes each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising at least an intact Igμ gene and at least one of an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and an intact Igα gene. In some embodiments, the non-human immunoglobulin heavy chain constant region nucleic acid sequence is a mouse or rat immunoglobulin heavy chain constant region nucleic acid sequence, e.g., comprising one or more heavy chain constant region genes each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising at least an intact Igμ gene and at least one of an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and an intact Igα gene. In some embodiments, the non-human animal is a rodent, and the human immunoglobulin light chain variable V_(L) and J_(L) gene segments are operably linked to a human heavy chain constant region nucleic acid sequence, e.g., comprising one or more heavy chain constant region genes each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising at least an intact Igγ, gene and at least one of an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and an intact Igα gene.

In some embodiments, the heavy chain constant region nucleic acid sequence comprises a nucleotide sequence that encodes a C_(H)1, a hinge, a C_(H)2, a C_(H)3, or a combination thereof. In some embodiments, heavy chain constant region nucleic acid sequence comprises a sequence that encodes a functional C_(H)1 domain.

In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is operably linked to a non-human light chain constant region nucleic acid sequence. In some embodiments, the non-human light chain constant region nucleic acid sequence is a mouse or a rat light chain constant region nucleic acid sequence. In some embodiments, the non-human animal is a rodent, and the rearranged human immunoglobulin light chain variable region nucleotide sequence is operably linked to a human light chain constant region nucleic acid sequence. In some certain embodiments, the light chain constant region nucleic acid sequence is a kappa sequence. In some certain embodiments, the light chain constant region nucleic acid sequence is a lambda sequence.

In some embodiments, the unrearranged human immunoglobulin light chain variable V_(L) and J_(L) gene segments are Vκ and Jκ gene segments. In some embodiments, the unrearranged human immunoglobulin light chain variable V_(L) and J_(L) gene segments are Vλ and Jλ gene segments.

In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises a human κ light chain variable region nucleotide sequence. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises a human λ light chain variable domain gene sequence.

In some embodiments, the unrearranged human immunoglobulin V_(L) gene segments include one or more of Vκ1-5, Vκ1-6, Vκ1-8, Vκ1-9, Vκ1-12, Vκ1-13, Vκ1-16, Vκ1-17, Vκ1-22, Vκ1-27, Vκ1-32, Vκ1-33, Vκ1-35, Vκ1-37, Vκ1-39, Vκ1D-8, Vκ1D-12, Vκ1D-13, Vκ1D-16, Vκ1D-17, Vκ1D-22, Vκ1D-27, Vκ1D-32, Vκ1D-33, Vκ1D-35, Vκ1D-37, Vκ1D-39, Vκ1D-42, Vκ1D-43, Vκ1-NL1, Vκ2-4, Vκ2-10, Vκ2-14, Vκ2-18, Vκ2-19, Vκ2-23, Vκ2-24, Vκ2-26, Vκ2-28, Vκ2-29, Vκ2-30, Vκ2-36, Vκ2-38, Vκ2-40, Vκ2D-10, Vκ2D-14, Vκ2D-18, Vκ2D-19, Vκ2D-23, Vκ2D-24, Vκ2D-26, Vκ2D-28, Vκ2D-29, Vκ2D-30, Vκ2D-36, Vκ2D-38, Vκ2D-40, Vκ3-7, Vκ3-11, Vκ3-15, Vκ3-20, Vκ3-25, Vκ3-31, Vκ3- 34, Vκ3D-7, Vκ3D-7, Vκ3D-11, Vκ3D-15, Vκ3D-15, Vκ3D-20, Vκ3D-25, Vκ3D-31, Vκ3D-34, Vκ3-NL1, Vκ3-NL2, Vκ3-NL3, Vκ3-NL4, Vκ3-NL5, W4-1, W5-2, Vκ6-21, W6D-21, W6D-41, and Vκ7-3. In some embodiments, the unrearranged human immunoglobulin J_(L) gene segments include Jκ1, Jκ2, Jκ3, Jκ4, and Jκ5.

In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises V_(L) and J_(L) gene segments. In some embodiments, the V_(L) gene segment in the rearranged human immunoglobulin light chain variable region is a (human germline) W gene segment selected from the group consisting of Vκ1-5, Vκ1-6, Vκ1-8, WI-9, Vκ1-12, Vκ1-13, Vκ1-16, Vκ1-17, Vκ1-22, Vκ1-27, Vκ1- 32, Vκ1-33, Vκ1-35, Vκ1-37, Vκ1-39, Vκ1D-8, Vκ1D-12, Vκ1D-13, Vκ3D-16, Vκ1D-17, Vκ1D-22, Vκ1D-27, Vκ1D-32, Vκ1D-35, Vκ1D-37, Vκ1D-39, Vκ1D-43, Via-NIA, Vκ2-4, Vκ2-10, Vκ2-14, Vκ2-18, Vκ2-19, Vκ2-23, Vκ2-24, Vκ2-26, Vκ2-28, Vκ2-29, Vκ2-30, Vκ2-36, Vκ2-38, Vκ2-40, Vκ2D-10, Vκ2D-14, Vκ2D-18, Vκ2D-19, Vκ2D-23, Vκ2D-24, Vκ2D-26, Vκ2D-28, Vκ2D-29, Vκ2D-30, Vκ2D-36, Vκ2D-38, Vκ2D-40, Vκ3-7, Vκ3-11, Vκ3-15, Vκ3-20, Vκ3-25, Vκ3-31, Vκ3-34, Vκ3D-7, Vκ3D-7, Vκ3D-11, Vκ3D-15, Vκ3D-15, Vκ3D-20, Vκ3D-25, Vκ3D-31, Vκ3D-34, Vκ3-NL2, Vκ3-NL3, Vκ3-NL4, Vκ3-NL5, Vκ4-1, Vκ5-2, Vκ6-21, Vκ6D-21, W6D-41, and Vκ7-3. In some certain embodiments, the V_(L) gene segment is selected from the group consisting of Vκ1-39 (e.g., a human germline Vκ1-39 gene segment) and Vκ3-20 (e.g., a human germline Vκ3-20 gene segment). In some embodiments, the J_(L) gene segment is selected from the group consisting of Jκ1, Jκ2, Jκ3, Jκ4, and Jκ5. In some certain embodiments, the rearranged human immunoglobulin light chain variable nucleotide sequence comprises Vκ1-39 and Jκ5 (e.g., the Vκ1-39 is rearranged with the Jκ5, e.g., a human germline Vκ1-39 gene segment is rearranged with a human germline Jκ5 gene segment). In some embodiments, the rearranged human immunoglobulin light chain variable region gene sequence comprises a sequence set forth as SEQ ID NO:1. In some certain embodiments, the rearranged human immunoglobulin light chain variable nucleotide sequence comprises Vκ3-20 and Jκ1 (e.g., the Vκ3-20 is rearranged with the Jκ1, e.g., a human germline Vκ3-20 gene segment is rearranged with a human germline Jκ1 gene segment). In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence comprises a sequence set forth as SEQ ID NO:2.

In some embodiments, the non-human animal is a rodent, and wherein the light chain constant region nucleic acid sequence is a rat or a mouse Cκ constant region nucleic acid sequence. In some embodiments, the non-human animal is a rodent, and wherein the heavy chain constant region nucleic sequence is a rat or mouse constant region sequence selected from the group consisting of Igμ, Igδ, Igγ, Igε, Igα, and a combination thereof, each of which encodes at least a functional C_(H)1 domain.

In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is at an endogenous immunoglobulin light chain locus in the genome. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is present in a germline genome of the non-human animal. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence is at a transcriptionally active locus in the genome.

In some embodiments, the non-human animal comprises a plurality of copies of the rearranged human immunoglobulin light chain variable region nucleotide sequence.

In some embodiments, the non-human animal comprises an Adam6a gene, an Adam6b gene or both. In some embodiments, a non-human animal comprises a functional ectopic mouse Adam6 gene.

In some embodiments, the present invention provides a non-human animal whose genome comprises (a) a first immunoglobulin locus comprising unrearranged human immunoglobulin light chain variable V_(L) and J_(L) gene segments operably linked to an immunoglobulin heavy chain constant region nucleic acid sequence; and (b) a second immunoglobulin locus comprising a rearranged non-human immunoglobulin light chain variable region nucleotide sequence operably linked to an immunoglobulin light chain constant region nucleic acid sequence.

In some embodiments, the non-human animal is a mammal or a bird. In some certain embodiments, the bird is a chicken. In some certain embodiments, the mammal is a rodent. In some embodiments, the rodent is selected from the group consisting of a mouse, a rat, and a hamster.

In some embodiments, the rearranged non-human immunoglobulin light chain variable region nucleotide sequence comprises rodent immunoglobulin Vκ and Jκ gene segments. In some certain embodiments, the rodent immunoglobulin Vκ and Jκ gene segments are mouse gene segments. In some certain embodiments, the rodent immunoglobulin Vκ and Jκ gene segments are rat gene segments.

In some embodiments, the present invention provides a method of making a non-human animal, the method comprising (a) modifying a genome of a non-human animal to delete or render non-functional all or substantially all (i) endogenous functional immunoglobulin heavy chain V_(H), D, and J_(H) gene segments and (ii) endogenous functional light chain V_(L) and J_(L) gene segments; (b) placing unrearranged human immunoglobulin light chain variable V_(L) and J_(L) gene segments in the genome so that the unrearranged light chain variable gene segments are operably linked to a heavy chain constant region nucleic acid sequence; and (c) placing a rearranged non-human immunoglobulin light chain variable region nucleotide sequence in the genome so that the rearranged human immunoglobulin light chain variable region nucleotide sequence is operably linked to a light chain constant region nucleic acid sequence.

In some embodiments, the present invention provides a method for obtaining a nucleic acid sequence that encodes an immunoglobulin light chain variable domain (V_(L)) capable of binding an antigen independently from a heavy chain variable domain, the method comprising (a) immunizing a non-human animal with an antigen that comprises an epitope or immunogenic portion thereof, wherein the non-human animal comprises in its genome (i) a rearranged non-human immunoglobulin light chain variable region nucleotide sequence operably linked to an immunoglobulin light chain constant region nucleic acid sequence, and (ii) unrearranged human immunoglobulin light chain variable region gene segments (V_(L) and J_(L)) operably linked to an immunoglobulin heavy chain constant region nucleic acid sequence; (b) allowing the non-human animal to mount an immune response; and (c) obtaining from the immunized non-human animal a nucleic acid sequence that encodes the light chain variable domain (V_(L) domain) that can bind the antigen.

In some embodiments, the present invention provides a method for making an antigen-binding protein that comprises an immunoglobulin light chain variable domain that can bind an antigen independently from a heavy chain variable domain, the method comprising (a) immunizing a non-human animal with an antigen that comprises an epitope or immunogenic portion thereof, wherein the non-human animal comprises in its genome (i) a rearranged non-human immunoglobulin light chain variable region nucleotide sequence operably linked to an immunoglobulin light chain constant region nucleic acid sequence; and (ii) unrearranged human immunoglobulin light chain variable region gene segments (V_(L) and J_(L)) operably linked to an immunoglobulin heavy chain constant region nucleic acid sequence; (b) allowing the non-human animal to mount an immune response to the first epitope or immunogenic portion thereof; (c) obtaining from the non-human animal a nucleic acid sequence that encodes the light chain variable domain that specifically binds the epitope or immunogenic portion thereof; (d) employing the nucleic acid sequence of (c) in an expression construct, fused to a human immunoglobulin constant region nucleic acid sequence; and (e) expressing the nucleic acid sequence of (c) in a production cell line to form an antigen-binding protein whose light chain is encoded by the nucleic acid of (c) and that binds the epitope or immunogenic portion thereof independently from a heavy chain.

In some embodiments, the present invention provides a non-human animal that comprises in its germline genome (a) a hybrid immunoglobulin chain locus comprising unrearranged human immunoglobulin light chain variable V_(L) and J_(L) gene segments operably linked to a heavy chain constant region nucleic acid sequence; and (b) an immunoglobulin light chain locus comprising two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments operably linked to an immunoglobulin light chain constant region nucleic acid sequence.

In some embodiments, the non-human animal is a mammal or a bird. In some certain embodiments, the bird is a chicken. In some certain embodiments, the mammal is a rodent. In some embodiments, the rodent is selected from the group consisting of a mouse, a rat, and a hamster.

In some embodiments, the non-human animal is homozygous for the two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments operably linked to an immunoglobulin light chain constant region nucleic acid sequence.

In some embodiments, the unrearranged human immunoglobulin light chain variable gene segments are operably linked to a non-human heavy chain constant region nucleic acid sequence. In some certain embodiments, the non-human heavy chain constant region nucleic acid sequence is a mouse or a rat heavy chain constant region nucleic acid sequence. In some embodiments, the non-human animal is a rodent, and the unrearranged human immunoglobulin light chain variable gene segments are operably linked to a human heavy chain constant region nucleic acid sequence. In some embodiments, the heavy chain constant region nucleic acid sequence comprises a nucleotide sequence that encodes a C_(H)1, a hinge, a C_(H)2, a C_(H)3, or a combination thereof.

In some embodiments, the two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments are operably linked to a non-human light chain constant region nucleic acid sequence. In some embodiments, the non-human light chain constant region nucleic acid sequence is a mouse or a rat light chain constant region nucleic acid sequence.

In some embodiments, the non-human animal is a rodent and the two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments are operably linked to a human light chain constant region nucleic acid sequence. In some embodiments, the light chain constant region nucleic acid sequence is a kappa sequence. In some embodiments, the light chain constant region nucleic acid sequence is a lambda sequence. In some embodiments, the immunoglobulin light chain locus is a kappa locus. In some embodiments, the immunoglobulin light chain locus is a lambda locus.

In some embodiments, the unrearranged human immunoglobulin V_(L) and J_(L) gene segments are human Vκ and Jκ gene segments. In some embodiments, the unrearranged human immunoglobulin V_(L) and J_(L) gene segments are human Vλ and Jλ gene segments.

In some embodiments, the two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments comprise a human κ light chain variable region nucleotide sequence. In some embodiments, the two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments comprise a human λ light chain variable region nucleotide sequence.

In some embodiments, the unrearranged human immunoglobulin light chain variable V_(L) gene segment is selected from the group consisting of Vκ1-5, Vκ1-8, Vκ1-9, Vκ1-12, Vκ1-13, Vκ1-16, Vκ1-17, Vκ1-22, Vκ1-27, Vκ1-32, Vκ1-33, Vκ1-35, Vκ1-37, Vκ1-39, Vκ1D-8, Vκ1D-12, Vκ1D-13, Vκ1D-16, Vκ1D-17, Vκ1D-22, Vκ1D-27, Vκ1D-32, Vκ1D-33, Vκ1D-35, Vκ1D-37, Vκ1D-39, Vκ1D-42, Vκ1D-43, Vκ1-NL1, Vκ2-4, Vκ2-10, Vκ2-14, Vκ2-18, Vκ2-19, Vκ2-23, Vκ2-24, Vκ2-26, Vκ2-28, Vκ2-29, Vκ2-30, Vκ2-36, Vκ2-38, Vκ2-40, Vκ2D-10, Vκ2D-14, Vκ2D-18, Vκ2D-19, Vκ2D-23, Vκ2D-24, Vκ2D-26, Vκ2D-28, Vκ2D-29, Vκ2D-30, Vκ2D-36, Vκ2D-38, Vκ2D-40, Vκ3-7, Vκ3-11, Vκ3-15, Vκ3-20, Vκ3-25, Vκ3-31, Vκ3-34, Vκ3D-7, Vκ3D-11, Vκ3D-15, Vκ3D-15, Vκ3D-20, Vκ3D-25, Vκ3D-31, Vκ3D-34, Vκ3-NL1, Vκ3-NL2, Vκ3-NL3, Vκ3-NL4, Vκ3-NL5, Vκ4-1, Vκ5-2, Vκ6-21, Vκ6D-21, Vκ6D-41, and Vκ7-3. In some embodiments, the unrearranged human immunoglobulin light chain variable J_(L) gene segment is selected from the group consisting of Jκ1, Jκ2, Jκ3, Jκ4, and Jκ5.

In some embodiments, the two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments comprises V_(L) and J_(L) gene segments.

In some embodiments, the V_(L) gene segments of the two or more variable region gene segments are selected from the group consisting of Vκ1-5, Vκ1-6, Vκ1-8, Vκ1-9, Vκ1-12, Vκ1-13, Vκ1-16, Vκ1-17, Vκ1-22, Vκ1-27, Vκ1-32, Vκ1-33, Vκ1-35, Vκ1-37, Vκ1-39, Vκ1D-8, Vκ1D-12, Vκ1D-13, Vκ1D-16, Vκ1D-17, Vκ1D-22, Vκ1D-27, Vκ1D-32, Vκ1D-33, Vκ1D-35, Vκ1D-37, Vκ1D-39, Vκ1D-42, Vκ1D-43, Vκ1-NL1, Vκ2-4, Vκ2-10, Vκ2-14, Vκ2-18, Vκ2-19, Vκ2-23, Vκ2-24, Vκ2-26, Vκ2-28, Vκ2-29, Vκ2-30, Vκ2-36, Vκ2-38, Vκ2-40, Vκ2D-10, Vκ2D-14, Vκ2D-18, Vκ2D-19, Vκ2D-23, Vκ2D-24, Vκ2D-26, Vκ2D-28, Vκ2D-29, Vκ2D-30, Vκ2D-36, Vκ2D-38, Vλ2D-40, Vκ3-7, Vκ3-11, Vκ3-15, Vκ3-20, Vκ3-25, Vκ3-31, Vκ3-34, Vκ3D-7, Vκ3D-7, Vκ3D-11, Vκ3D-15, Vκ3D-15, Vκ3D-20, Vκ3D-25, Vκ3D-31, Vκ3D-34, Vκ3-NL1, Vκ3-NL2, Vκ3-NL3, Vκ3-NL4, Vκ3-NL5, Vκ4-1, Vκ5-2, Vκ6-21, Vκ6D-21, Vκ6D-41, and Vκ7-3. In some certain embodiments, the V_(L) gene segments are selected from the group consisting of Vκ1-39, Vκ3-20, and a combination thereof. In some embodiments, the J_(L) gene segment is selected from the group consisting of Jκ1, Jκ2, Jκ3, Jκ4, Jκ5, and a combination thereof.

In some embodiments, the two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments comprise two or more but less than wild type number of human V_(L) gene segments and one or more human J_(L) gene segments. In some certain embodiments, the two or more but less than the wild type number of V_(L) gene segments comprises Vκ1-39 and Vκ3-20 gene segments and one or more J_(L) gene segments comprises Jκ1, Jκ2, Jκ3, Jκ4, Jκ5, or a combination thereof.

In some embodiments, the non-human animal is a rodent, and wherein the light chain constant region nucleic acid sequence is a rat or a mouse Cκ constant region nucleic acid sequence.

In some embodiments, the non-human animal is a rodent, and wherein the heavy chain constant region nucleic acid sequence is a rat or mouse constant region sequence selected from the group consisting of Igμ, Igδ, Igγ, Igε, Igα, and a combination thereof, each of which encodes a functional C_(H)1 domain.

In some embodiments, substantially all endogenous variable heavy chain V_(H), D, and J_(H) gene segments are deleted from the immunoglobulin heavy chain locus of the non-human animal or rendered non-functional.

In some embodiments, substantially all endogenous light chain V_(L) and J_(L) gene segments are deleted from the immunoglobulin light chain locus of the non-human animal or rendered non-functional.

In some embodiments, the non-human animal comprises an Adam6a gene, an Adam6b gene, or both. In some embodiments, a non-human animal comprises a functional ectopic mouse Adam6 gene.

In some embodiments, the immunoglobulin light chain locus comprises a plurality of copies of the two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments (V_(L) and J_(L)).

In some embodiments, the present invention provides a method of making a non-human animal, the method comprising (a) modifying a genome of a non-human animal to delete or render non-functional all or substantially all (i) endogenous immunoglobulin heavy chain V_(H), D, and J_(H) gene segments and (ii) endogenous light chain V_(L) and J_(L) gene segments; (b) placing unrearranged human immunoglobulin light chain variable V_(L) and J_(L) gene segments in the genome such that the unrearranged light chain variable gene segments are operably linked to a heavy chain constant region nucleic acid sequence; and (c) placing two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments in the genome such that the human immunoglobulin light chain variable region gene segments are operably linked to a light chain constant region nucleic acid sequence.

In some embodiments, the non-human animal is a mammal or a bird. In some certain embodiments, the bird is a chicken. In some certain embodiments, the mammal is a rodent. In some embodiments, the rodent is selected from the group consisting of a mouse, a rat, and a hamster.

In some embodiments, the unrearranged human immunoglobulin light chain variable V_(L) and J_(L) gene segments are operably linked to a non-human immunoglobulin heavy chain constant region nucleic acid sequence. In some certain embodiments, the non-human immunoglobulin heavy chain constant region nucleic acid sequence is a mouse or rat immunoglobulin heavy chain constant region nucleic acid sequence.

In some embodiments, the non-human animal is a rodent, and the unrearranged human immunoglobulin light chain variable V_(L) and J_(L) gene segments are operably linked to a human heavy chain constant region nucleic acid sequence. In some embodiments, the heavy chain constant region nucleic acid sequence comprises a nucleotide sequence that encodes a C_(H)1, a hinge, a C_(H)2, a C_(H)3, or a combination thereof. In some embodiments, the heavy chain constant region nucleic acid sequence comprises a nucleotide sequence that encodes a functional C_(H)1 domain.

In some embodiments, the two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments are operably linked to a non-human light chain constant region nucleic acid sequence. In some embodiments, the non-human light chain constant region nucleic acid sequence is a mouse or a rat light chain constant region nucleic acid sequence.

In some embodiments, the non-human animal is a rodent and the two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments are operably linked to a human light chain constant region nucleic acid sequence. In some embodiments, the light chain constant region nucleic acid sequence is a kappa sequence. In some embodiments, the light chain constant region nucleic acid sequence is a lambda sequence.

In some embodiments, the two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments are placed in a kappa light chain locus. In some embodiments, the two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments are placed in a lambda light chain locus.

In some embodiments, the unrearranged human immunoglobulin V_(L) and J_(L) gene segments are human Vκ and Jκ gene segments. In some embodiments, the unrearranged human immunoglobulin V_(L) and J_(L) gene segments are human Vλ and Jλ gene segments.

In some embodiments, the two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments comprise a human κ light chain variable region nucleotide sequence. In some embodiments, the two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments comprise a human λ light chain variable region nucleotide sequence.

In some embodiments, the unrearranged human immunoglobulin light chain variable V_(L) gene segment is selected from the group consisting of Vκ1-5, Vκ1-6, Vκ1-8, Vκ1-9, Vκ1-12, Vκ1-13, Vκ1-16, Vκ1-17, Vκ1-22, Vκ1-27, Vκ1-32, Vκ1-33, Vκ1-35, Vκ1-37, Vκ1-39, Vκ1D-8, Vκ1D-12, Vκ1D-13, Vκ1D-16, Vκ1D-17, Vκ1D-22, Vκ1D-27, Vκ1D-32, Vκ1D-33, Vκ1D-35, Vκ1D-37, Vκ1D-39, Vκ1D-42, Vκ1D-43, Vκ1-NL1, Vκ2-4, Vκ2-10, Vκ2-14, Vκ2-18, Vκ2-19, Vκ2-23, Vκ2-24, Vκ2-26, Vκ2-28, Vκ2-29, Vκ2-30, Vκ2-36, Vκ2-38, Vκ2-40, Vκ2D-10, Vκ2D-14, Vκ2D-18, Vκ2D-19, Vκ2D-23, Vκ2D-24, Vκ2D-26, Vκ2D-28, Vκ2D-29, Vκ2D-30, Vκ2D-36, Vκ2D-38, Vκ2D-40, Vκ3-11, Vκ3-15, Vκ3-20, Vκ3-25, Vκ3-31, Vκ3-34, Vκ3D-7, Vκ3D-7, Vκ3D-11, Vκ3D-15, Vκ3D-15, Vκ3D-20, Vκ3D-25, Vκ3D-31, Vκ3D-34, Vκ3-NL1, Vκ3-NL2, Vκ3-NL3, Vκ3-NL4, Vκ3-NL5, Vκ4-1, Vκ5-2, Vκ6-21, Vκ6D-21, Vκ6D-41, and Vκ7-3. In some embodiments, the unrearranged human immunoglobulin J_(L) gene segment is selected from the group consisting of Jκ1, Jκ2, Jκ3, Jκ4, and Jκ5.

In some embodiments, the two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments comprise V_(L) and J_(L) gene segments.

In some embodiments, the V_(L) gene segments of the two or more human immunoglobulin light chain variable region gene segments are selected from the group consisting of Vκ1-5, Vκ1-6, Vκ1-8, Vκ1-9, Vκ1-12, Vκ1-13, Vκ1-16, Vκ1-17, Vκ1-22, Vκ1-27, Vκ1-32, Vκ1-33, Vκ1-35, Vκ1-37, Vκ1-39, Vκ1D-8, Vκ1D-12, Vκ1D-13, Vκ1D-16, Vκ1D-17, Vκ1D-22, Vκ1D-27, Vκ1D-32, Vκ1D-33, Vκ1D-35, Vκ1D-37, Vκ1D-39, Vκ1D-42, Vκ1D-43, Vκ2-4, Vκ2-10, Vκ2-14, Vκ2-18, Vκ2-19, Vκ2-23, Vκ2-24, Vκ2-26, Vκ2-28, Vκ2-29, Vκ2-30, Vκ2-36, Vκ2-38, Vκ2-40, Vκ2D-10, Vκ2D-14, Vκ2D-18, Vκ2D-19, Vκ2D-23, Vκ2D-24, Vκ2D-26, Vκ2D-28, Vκ2D-29, Vκ2D-30, Vκ2D-36, Vκ2D-38, Vκ2D-40, Vκ3-7, Vκ3-11, Vκ3-15, Vκ3-20, Vκ3-25, Vκ3-31, Vκ3-34, Vκ3D-7, Vκ3D- 7, Vκ3D-11, Vκ3D-15, Vκ3D-15, Vκ3D-20, Vκ3D-25, Vκ3D-31, Vκ3D-34, Vκ3-NL1, Vκ3-NL2, Vκ3-NL3, Vκ3-NL4, Vκ3-NL5, Vκ4-1, Vκ5-2, Vκ6-21, Vκ6D-21, Vκ6D-41, and Vκ7-3. In some certain embodiments, the V_(L) gene segments are selected from the group consisting of Vκ1-39, Vκ3-20, and a combination thereof. In some embodiments, the J_(L) gene segment is selected from the group consisting of Jκ1, Jκ2, Jκ3, Jκ4, Jκ5, and a combination thereof.

In some embodiments, the two or more but less than wild type number of human immunoglobulin light chain variable region gene segments comprises two or more but less than wild type number of human V_(L) gene segments and one or more J_(L) gene segments. In some certain embodiments, the two or more but less than the wild type number of human V_(L) gene segments comprises Vκ1-29 and Vκ3-20 gene segments.

In some embodiments, the non-human animal is a rodent, and wherein the light chain constant region nucleic acid sequence is a rat or a mouse Cκ constant region nucleic acid sequence.

In some embodiments, the non-human animal is a rodent, and wherein the heavy chain constant region nucleic acid sequence is a rat or mouse constant region sequence selected from the group consisting of Igμ, Igδ, Igγ, Igε, Igα, each of which encodes a functional C_(H)1 domain.

In some embodiments, the two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments are placed at an endogenous immunoglobulin light chain locus in the genome. In some embodiments, the two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments are present in a germline genome of the non-human animal. In some embodiments, the two or more but less than the wild type number of human immunoglobulin light chain variable region gene segments are present at an ectopic locus in the genome.

In some embodiments, the non-human animal comprises an Adam6a gene, an Adam6b gene or both. In some embodiments, a non-human animal comprises a functional ectopic mouse Adam6 gene.

Other features, objects, and advantages of the present invention are apparent in the detailed description that follows. It should be understood, however, that the detailed description, while indicating embodiments of the present invention, is given by way of illustration only, not limitation. Various changes and modifications within the scope of the invention will become apparent to those skilled in the art from the detailed description.

BRIEF DESCRIPTION OF THE DRAWINGS

The Drawings included herein, which is comprised of the following Figures, is for illustration purposes only not for limitation.

FIG. 1 illustrates a schematic (not to scale) of the mouse heavy chain locus (at top), and a schematic (not to scale) of the human κ light chain locus (at bottom). The mouse heavy chain locus is about 3 Mb in length and contains approximately 200 heavy chain variable (V_(H)) gene segments, 13 heavy chain diversity (D_(H)) gene segments and 4 heavy chain joining (J_(H)) gene segments as well as enhancers (Enh) and heavy chain constant (C_(H)) regions. The human κ light chain locus is duplicated into distal and proximal contigs of opposite polarity spanning about 440 kb and 600 kb, respectively. Between the two contigs is about 800 kb of DNA that is believed to be free of Vκ gene segments. The human κ light chain locus contains about 76 Vκ gene segments, 5 Jκ gene segments, an intronic enhancer (Enh) and a single constant region (Cκ).

FIG. 2 shows a targeting strategy for progressive insertion of 40 human Vκ and 5 human Jκ gene segments into a mouse heavy chain locus. Hygromycin (hyg) and Neomycin (neo) selection cassettes are shown with recombinase recognition sites (R1, R2, etc.). A modified mouse heavy chain locus, e.g., a hybrid immunoglobulin locus comprising human Vκ and Jκ gene segments operably linked to mouse C_(H) regions, is shown at the bottom.

FIG. 3 shows an exemplary targeting strategy for progressive insertion of human Vλ and a human Jλ gene segment (or four human Jλ gene segments) into the mouse heavy chain locus. Hygromycin (hyg) and Neomycin (neo) selection cassettes are shown with recombinase recognition sites (R1, R2, etc.). A modified mouse heavy chain locus, e.g., a hybrid immunoglobulin locus, comprising human Vλ and Jλ gene segments (one or four) operably linked to mouse C_(H) regions is shown at the bottom.

FIG. 4 illustrates an exemplary targeting strategy for replacing endogenous mouse immunoglobulin κ light chain variable region gene segments with a rearranged human V_(L)/J_(L) sequence.

FIG. 5 illustrates two exemplary targeting vectors for replacing endogenous mouse immunoglobulin light chain Vκ and Jκ gene segments with a rearranged human Vκ1-39Jκ5 sequence (MAID 1633; SEQ ID NO:1) or a rearranged human Vκ3-2Jκ1 sequence (MAID 1635; SEQ ID NO:2).

FIG. 6 shows a modified mouse heavy chain locus, e.g., a hybrid immunoglobulin locus comprising human Vκ and Jκ gene segments operably linked to mouse C_(H) regions and a modified mouse κ light chain locus comprising a rearranged human VκJκ sequence. In one particular embodiment, a mouse having the modified heavy chain locus and modified κ light chain locus as shown (KOH×ULC) is created by breeding a “KOH” mouse and a “ULC” mouse.

FIG. 7 shows representative contour plots of bone marrow stained for B and T cells (top row; CD19⁺ and CD3⁺, respectively) and bone marrow gated on CD19⁺ and stained for ckit⁺ and CD43⁺ (bottom row) from a VELOCIMMUNE® mouse (VI3), a mouse homozygous for unrearranged human immunoglobulin light chain variable Vκ and Jκ gene segments at the heavy chain locus, homozygous for unrearranged human immunoglobulin Vκ and Jκ gene segments at the κ light chain locus and an integrated Adam6 gene (“KOH” mouse; 1994HO 1242HO), and a mouse homozygous for a rearranged light chain variable region nucleotide sequence at the κ light chain locus (either V_(K)3-20J_(K)1 or V_(K)1-39J_(K)5) and homozygous for unrearranged human immunoglobulin Vκ and Jκ gene segments at the heavy chain locus and an integrated Adam6 gene (1994HO 1635HO for V_(K)3-20J_(K)1; 1994 HO 1633HO for V_(K)1-39J_(K)5; “KOH×ULC” mouse). Pro and Pre B cells are noted on the contour plots. Percentage of cells within each gated region is shown.

FIG. 8 shows the total number of cells (top left), the total number of B (CD19⁺) cells (top, right), the number of Pro B cells (CD19⁺CD43⁺ckit⁺), and the number of Pre B cells (CD19⁺CD43⁻ckit⁻) in bone marrow isolated from the femurs of KOH×ULC mice (1994HO 1633HO:Vκ1-39Jκ5; 1994HO 1635HO:Vκ3-20Jκ1), KOH mice (1994HO 1242HO) and VELOCIMMUNE® humanized mice (VI3).

FIG. 9 shows representative contour plots (top row) of hone marrow gated on singlets stained for immunoglobulin M (IgM) and B220 from KOH×ULC mice (1994HO 1633HO, 1994HO 1635HO), a KOH mouse (1994HO 1242HO) and a VELOCIMMUNE® humanized mouse (VI3). Immature, mature and pro/pre B cells are noted on each of the contour plots. Percentage of cells within each gated region is shown; the bottom row shows the total number and immature B (left, IgM⁺B220^(int)) cells and mature B (IgM⁺B220^(hi)) in hone marrow isolated from the femurs of KOH×ULC mice (1994HO 1633HO:Vκ1-39Jκ5; 1994HO 1635HO:Vκ3-20Jκ1), KOH mice (1994HO 1242HO) and VELOCIMMUNE® humanized mice (VI3).

FIG. 10 shows representative contour plots (left column) of bone marrow gated on singlets stained for immunoglobulin M (IgM) and B220 from KOH×ULC mice (1994HO 1633HO, 1994HO 1635HO), a KOH mouse (1994HO 1242HO) and a VELOCIMMUNE® humanized mouse (VI3). Immature, mature and pro/pre B cells are noted on each of the contour plots; the right two columns shows representative contour plots of bone marrow gated on immature B cells (left, IgM⁺B220^(int)) and mature B cells (right, IgM⁺B220^(hi)) stained for Igκ and Igλ expression isolated from the femurs of KOH×ULC mice (1994HO 1633HO:Vκ1-39Jκ5; 1994HO 1635HO:Vκ3-20Jκ1), a KOH mouse (1994HO 1242HO) and VELOCIMMUNE® humanized mouse (VI3). Percentage of cells within each gated region is shown.

FIG. 11 shows representative contour plots of splenocytes stained for B and T cells (top row; CD19⁺ and CD3⁺, respectively) and splenocytes gated on CD19⁺ and stained for Igκ⁺ and Igλ⁺ expression from KOH×ULC mice (1994HO 1633HO:Vκ1-39Jκ5; 1994HO 1635HO:Vκ3-20Jκ1), a KOH mouse (1994HO 1242HO) and a VELOCIMMUNE® humanized mouse (VI3). Percentage of cells within each gated region is shown.

FIG. 12 shows the total number of B cells (CD19⁺), Igκ⁺ B cells (CD19⁺Kappa⁺) and Igλ+ B cells (CD19⁺Lambda⁺) in harvested spleens from KOH×ULC mice (1994HO 1633HO:Vκ1-39Jκ5; 1994HO 1635HO:Vκ3-20Jκ1), KOH mice (1994HO 1242HO) and VELOCIMMUNE® humanized mice (VI3).

FIG. 13 shows representative contour plots of splenocytes gated on CD19+ and stained for immunoglobulin D (IgD) and immunoglobulin M (IgM) from KOH×ULC mice (1994HO 1633HO:Vκ1-39Jκ5; 1994HO 1635HO:Vκ3-20Jκ1), a KOH mouse (1994HO 1242HO) and a VELOCIMMUNE® humanized mouse (VI3). Percentage of cells within each gated region is shown. Mature (CD19⁺IgM^(lo)IgD^(hi)) and transitional/immature (CD19⁺IgD^(int)IgM^(hi)) B cells are noted in each contour plot. Percentage of cells within each gated region is shown.

FIG. 14 shows the absolute number of splenocytes (top left), the total number of B cells (top right; CD19⁺), Transitional B cells (bottom left; CD19⁺IgD^(lo) IgM^(hi)), and mature B cells)(CD19⁺IgD^(hi)IgM^(lo) in harvested spleens from KOH×ULC mice (1994HO 1633HO:Vκ1-39Jκ5; 1994HO 1635HO:Vκ3-20Jκ1), a KOH mouse (1994HO 1242HO) and a VELOCIMMUNE® humanized mouse (VI3).

FIG. 15 shows representative contour plots of the peripheral B cell development KOH×ULC mice (1994HO 1633HO:Vκ1-39Jκ5; 1994HO 1635HO:Vκ3-20Jκ1), a KOH mouse (1994HO 1242HO) and a VELOCIMMUNE® humanized mouse (VI3). The first column (left) of contour plots show CD93⁺ and B220⁺ splenocytes gated on CD19⁺ indicating immature and mature B cells. The second column (middle) of contour plot shows IgM⁺ and CD23⁺ expression in immature B cells indicating T1 (IgD-IgM⁺CD21^(lo)CD23⁻), T2 (IgD^(hi)IgM^(hi)CD21^(mid)CD23⁺) and T3 B cell populations. The third column (right) of contour plots shows CD21⁺ (CD35⁺) and IgM⁺ expression of mature B cells indicating a smaller first population that give rise to marginal zone B cells and a second population that gives rise to follicular (FO) B cells. Percentage of cells within each gated region is shown.

FIG. 16 shows anti-Antigen 1 antibody titers in different KOH×ULC mice following immunization, resting phase and boost protocols. KOH×ULC mice mount a strong, high titer antigen-specific antibody response comparable to VI3 and ULC mice following a resting phase and additional boosts. Mice were immunized by the footpad route. The 2^(nd) bleed is following six boosts; 3^(rd) bleed is following four additional boosts. Mice were resting for 4.5 weeks after the 2^(nd) bleed. Antigen 1 is a carrier protein. VI3 mice are disclosed in U.S. Pat. No. 8,642,835, herein incorporated by reference. 1633 is a ULC mouse with Vκ1-39. 1635 is a ULC mouse with Vκ3-20.

FIG. 17 shows a schematic of antigen specific binding protein Fabs constructed from KOH×ULC mice. Antigen positive B cells were sorted from two KOH×ULC mice following immunization protocol with immunogen Antigen 2. Positive KOH variable domains were cloned into Fab plasmids. KOH variable domains were cloned into heavy constants. Transient transfections were carried out to produce protein for antigen positive screening by ELISA. KOH-CH1 Fab was transfected with human Vκ3-20 germ line (GL) ULC. Fabs were assayed by ELISA and BIACORE™ for binding to Antigen 2, cell surface protein. A number of antigen specific binders were identified by ELISA and BIACORE™ assays. Fourteen (14) samples bound antigen 2 at neutral pH as determined by ELISA. Binding was confirmed by BIACORE™ for 13 of the 14 ELISA binders.

FIG. 18 shows a Table with BIACORE™ binding data for representative V_(L) domains that retain binding to Antigen 2 when paired with a V_(H×ULC) domain from an antibody to an unrelated enzyme, anti-Antigen 3 antibody. The data shows that binding proteins from KOH×ULC mice have specificity solely in a single V_(L) domain.

FIG. 19 shows schematic representation of different multispecific antigen-binding protein formats. (A) shows a schematic of a the generation of a multispecific antigen-binding protein comprising (1) a first heavy chain that has a human V_(κ) (hVκ_(/CH×ULC)) domain fused with a human heavy chain constant region, the hVκ_(/CH×ULC) being cognate to a first universal light chain variable domain and capable of binding a first antigen A (Ag A) and (2) a second heavy chain that has a human V_(H) (hV_(H×ULC)) domain fused with a second human heavy chain constant region, the hV_(H×ULC) domain being cognate to a second universal light chain variable domain and capable of binding a second antigen B (Ag B), each of which heavy chains is paired with an identical universal light chain that comprises a third universal light chain variable domain fused with a human light chain constant region, wherein the third universal light chain is encoded by a rearranged V_(L)/J_(L) gene sequence from which the first and second universal light chains were derived. The hVκ_(/CH×ULC) domain of the final multispecific antigen binding protein is derived from antigen-binding protein raised against antigen A in a KOH×ULC mouse, which generates the hVκ_(/CH×ULC) domain fused to a mouse heavy chain constant region and paired with a universal light chain comprising a human universal light chain variable domain fused with a mouse light chain constant domain. The hV_(H×ULC) domain of the final multispecific antigen binding protein is derived from antigen-binding protein raised against antigen B in a ULC mouse, which generates the hV_(H×ULC) domain fused to a mouse heavy chain constant region and paired with a universal light chain comprising a human universal light chain variable domain fused with a mouse light chain constant domain. (B) shows a schematic of a the generation of a multispecific antigen-binding protein comprising (1) a first heavy chain that has a first human V_(κ) (hVκ_(/CH×ULC)) domain fused with a human heavy chain constant region, the first hVκ_(/CH×ULC) being cognate to a first universal light chain variable domain and capable of binding a first antigen A (Ag A) and (2) a second heavy chain that has a second human V (hVκ_(/CH×ULC)) domain fused with a second human heavy chain constant region, the second hVκ_(/CH×ULC) domain being cognate to a second universal light chain variable domain and capable of binding a second antigen B (Ag B), each of which heavy chains is paired with an identical universal light chain that comprises a third universal light chain variable domain fused with a human light chain constant region, wherein the third universal light chain is encoded by a rearranged V_(L)/J_(L) gene sequence from which the first and second universal light chains were derived. The first hVκ_(/CH×ULC) domain of the final multispecific antigen binding protein is derived from antigen-binding protein raised against antigen A in a KOH×ULC mouse, which generates the first hVκ_(/CH×ULC) domain fused to a mouse heavy chain constant region and paired with a universal light chain comprising a human universal light chain variable domain fused with a mouse light chain constant domain. The second hVκ_(/CH×ULC) domain of the final multispecific antigen binding protein is derived from antigen-binding protein raised against antigen B in a KOH×ULC mouse (e.g., second KOH×ULC mouse), which generates the second hVκ_(/CH×ULC) domain fused to a mouse heavy chain constant region and paired with a universal light chain comprising a human universal light chain variable domain fused with a mouse light chain constant domain.

FIG. 20 shows a Table with BIACORE™ binding data for representative antigen-binding proteins having a structure depicted in FIG. 19A (B1-B3). Binding data for control antibodies (C_(KOH)1-C_(KOH)2, C_(VH), C_(I), and C) are also included. NT=not tested, NA=not applicable, NB=not bound.

DEFINITIONS

This invention is not limited to particular methods, and experimental conditions described, as such methods and conditions may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting, since the scope of the present invention is defined by the claims.

Unless defined otherwise, all terms and phrases used herein include the meanings that the terms and phrases have attained in the art, unless the contrary is clearly indicated or clearly apparent from the context in which the term or phrase is used. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, particular methods and materials are now described. All publications mentioned are hereby incorporated by reference.

The term “antibody”, as used herein, includes immunoglobulin molecules comprising four polypeptide chains, two heavy (H) chains and two light (L) chains inter-connected by disulfide bonds. Each heavy chain comprises a heavy chain variable domain and a heavy chain constant region (CH). The heavy chain constant region comprises several domains, e.g., C_(H)1, a hinge region, C_(H)2, C_(H)3 and, optionally C_(H)4. Each light chain comprises a light chain variable domain and a light chain constant region (CL). The heavy chain and light chain variable domains can be further subdivided into regions of hypervariability, termed complementarity determining regions (CDR), interspersed with regions that are more conserved, termed framework regions (FR). Each heavy and light chain variable domain comprises three CDRs and four FRs, arranged from amino-terminus to carboxy-terminus in the following order: FR1, CDR1, PR2, CDR2, FR3, CDR3, FR4 (heavy chain CDRs may be abbreviated as HCDR1, HCDR2 and HCDR3; light chain CDRs may be abbreviated as LCDR1, LCDR2 and LCDR3). The term “high affinity” antibody refers to an antibody that has a K_(D) with respect to its target epitope about of 10-9 M or lower (e.g., about 1×10⁻⁹ M, 1×10⁻¹⁰ M, 1×10⁻¹¹ M, or about 1×10⁻¹² M). In one embodiment, K_(D) is measured by surface plasmon resonance, e.g., BIACORE™; in another embodiment, K_(D) is measured by ELISA.

The term “biologically active” as used herein includes a characteristic of any agent that has activity in a biological system, in vitro or in vivo (e.g., in an organism). For instance, an agent that, when present in an organism, has a biological effect within that organism, is considered to be biologically active. In particular embodiments, where a protein or polypeptide is biologically active, a portion of that protein or polypeptide that shares at least one biological activity of the protein or polypeptide is typically referred to as a “biologically active” portion.

The phrase “antigen-binding protein” includes a mono-specific, a bi-specific or higher order antigen-binding protein that respectively and selectively binds one, two or more antigenic determinants Bispecific antigen-binding proteins generally comprise two nonidentical binding components, with each binding component specifically binding a different epitopeeither on two different molecules (e.g., different epitopes on two different immunogens) or on the same molecule (e.g., different epitopes on the same immunogen). If a bispecific antigen-binding protein is capable of selectively binding two different epitopes (a first epitope and a second epitope), the affinity of the first binding component for the first epitope will generally be at least one to two or three or four or more orders of magnitude lower than the affinity of the first binding component for the second epitope, and vice versa. Epitopes specifically bound by a bispecific antigen-binding protein can be on the same or a different target (e.g., on the same or a different protein). Exemplary bispecific antigen-binding protein include those with a first binding component specific for a tumor antigen and a second binding component specific for a cytotoxic marker, e.g., an Fc receptor (e.g., FcγRI, FcγRII, FcγRIII, etc.) or a T cell marker (e.g., CD3, CD28, etc.). Further, the second binding component can be substituted with a binding component having a different desired specificity. For example, a bispecific antigen-binding protein with a first binding component specific for a tumor antigen and a second binding component specific for a toxin can be paired so as to deliver a toxin (e.g., saporin, vinca alkaloid, etc.) to a tumor cell. Other exemplary bispecific antigen-binding protein include those with a first binding component specific for an activating receptor (e.g., B cell receptor, FcγRI, FcγRIIA, FcγRIIIA, FcγRI, FcεRI, T cell receptor, etc.) and a second binding component specific for an inhibitory receptor (e.g., FcγRIIB, CD5, CD22, CD72, CD300a, etc.). Such bispecific antigen-binding proteins can be constructed for therapeutic conditions associated with cell activation (e.g. allergy and asthma). Bispecific antigen-binding proteins can be made, for example, by combining binding components that recognize different epitopes of the same immunogen. For example, nucleic acid sequences encoding binding components (e.g., light or heavy chain variable sequences) that recognize different epitopes of the same immunogen can be fused to nucleic acid sequences encoding the same or different heavy chain constant regions, the same or different light chains, or respectively a heavy chain constant region and a light chain constant region, and such sequences can be expressed in a cell as a multispecific antigen-binding protein in a format that is similar to a Fab structure, scFab structure, a diabody structure, an scFv structure, an scFv-Fc structure, an scFv-zipper structure, or a tetrameric structure similar to a typical antibody that includes the cognate universal light chain. An exemplary bispecific antigen-binding protein has two heavy chains each having three light chain CDRs, followed by (N-terminal to C-terminal) a C_(H)1 domain, a hinge, a C_(H)2 domain, and a C_(H)3 domain, and an immunoglobulin light chain that either does not confer epitope-binding specificity but that can associate with each light chain, or that can associate with each light chain and that can bind one or more of the epitopes bound by the light chain epitope-binding regions, or that can associate with each light chain and enable binding of one or both of the light chains to one or both epitopes. Similarly, the term “trispecific antibody” includes an antigen-binding protein capable of selectively binding three or more epitopes.

The phrase “complementarity determining region,” or the term “CDR,” includes an amino acid sequence encoded by a nucleic acid sequence of an organism's immunoglobulin genes that normally (i.e., in a wild-type animal) appears between two framework regions in a variable region of a light or a heavy chain of an immunoglobulin molecule (e.g., an antibody or a T cell receptor). A CDR can be encoded by, for example, a germ line sequence or a rearranged or unrearranged sequence, and, for example, by a naive or a mature B cell or a T cell. A CDR can be somatically mutated (e.g., vary from a sequence encoded in an animal's germ line), humanized, and/or modified with amino acid substitutions, additions, or deletions. In some circumstances (e.g., for a CDR3), CDRs can be encoded by two or more sequences (e.g., germ line sequences) that are not contiguous (e.g., in an unrearranged nucleic acid sequence) but are contiguous in a B cell nucleic acid sequence, e.g., as the result of splicing or connecting the sequences (e.g., V-D-J recombination to form a heavy chain CDR3).

The term “comparable”, as used herein, includes two or more agents, entities, situations, sets of conditions, etc. that may not be identical to one another but that are sufficiently similar to permit comparison there between so that conclusions may reasonably be drawn based on differences or similarities observed. Those of ordinary skill in the art will understand, in context, what degree of identity is required in any given circumstance for two or more such agents, entities, situations, sets of conditions, etc. to be considered comparable.

The term “conservative” as used herein to describe a conservative amino acid substitution includes substitution of an amino acid residue by another amino acid residue having a side chain R group with similar chemical properties (e.g., charge or hydrophobicity). In general, a conservative amino acid substitution will not substantially change the functional properties of interest of a protein, for example, the ability of a receptor to bind to a ligand. Examples of groups of amino acids that have side chains with similar chemical properties include aliphatic side chains such as glycine, alanine, valine, leucine, and isoleucine; aliphatic-hydroxyl side chains such as serine and threonine; amide-containing side chains such as asparagine and glutamine; aromatic side chains such as phenylalanine, tyrosine, and tryptophan; basic side chains such as lysine, arginine, and histidine; acidic side chains such as aspartic acid and glutamic acid; and, sulfur-containing side chains such as cysteine and methionine. Conservative amino acids substitution groups include, for example, valine/leucine/isoleucine, phenylalanine/tyrosine, lysine/arginine, alanine/valine, glutamate/aspartate, and asparagine/glutamine. In some embodiments, a conservative amino acid substitution can be substitution of any native residue in a protein with alanine, as used in, for example, alanine scanning mutagenesis. In some embodiments, a conservative substitution is one that that has a positive value in the PAM250 log-likelihood matrix disclosed in Gonnet et al. (1992) Exhaustive Matching of the Entire Protein Sequence Database, Science 256:1443-45, hereby incorporated by reference. In some embodiments, a substitution is deemed to be “moderately conservative” if it has a nonnegative value in the PAM250 log-likelihood matrix.

In some embodiments, residue positions in an immunoglobulin light chain or heavy chain differ by one or more conservative amino acid substitutions. In some embodiments, residue positions in an immunoglobulin light chain or functional fragment thereof (e.g., a fragment that allows expression and secretion from, e.g., a B cell) are not identical to a light chain whose amino acid sequence is listed herein, but differs by one or more conservative amino acid substitutions.

The term “disruption” as used herein includes the result of an event that interrupts (e.g., via homologous recombination) a DNA. In some embodiments, a disruption may achieve or represent a deletion, insertion, inversion, modification, replacement, substitution, or any combination thereof, of a DNA sequence(s). In some embodiments, a disruption may achieve or represent introduction of a mutation, such as a missense, nonsense, or frame-shift mutation, or any combination thereof, in a coding sequence(s) in DNA. In some embodiments, a disruption may occur in a gene or gene locus endogenous to a cell. In some embodiments, insertions may include the insertion of entire genes or fragments of genes, e.g. exons, into an endogenous site in a cell or genome. In some embodiments, insertions may introduce sequences that are of an origin other than that of an endogenous sequence into which they are inserted. In some embodiments, a disruption may increase expression and/or activity of a gene or gene product (e.g., of a protein encoded by a gene). In some embodiments, a disruption may decrease expression and/or activity of a gene or gene product. In some embodiments, a disruption may alter sequence of a gene or gene product (e.g., an encoded protein). In some embodiments, a disruption may truncate or fragment a gene or gene product (e.g., an encoded protein). In some embodiments, a disruption may extend a gene or gene product; in some such embodiments, a disruption may achieve assembly of a fusion protein. In some embodiments, a disruption may affect level but not activity of a gene or gene product. In some embodiments, a disruption may affect activity but not level of a gene or gene product. In some embodiments, a disruption may have no significant effect on level of a gene or gene product. In some embodiments, a disruption may have no significant effect on activity of a gene or gene product. In some embodiments, a disruption may have no significant effect on either level or activity of a gene or gene product.

The phrase “endogenous locus” or “endogenous gene” as used herein includes a genetic locus found in a parent or reference organism prior to introduction of a disruption (e.g., deletion, insertion, inversion, modification, replacement, substitution, or a combination thereof as described herein). In some embodiments, an endogenous locus has a sequence found in nature. In some embodiments, an endogenous locus is wild type. In some embodiments, a reference organism that contains an endogenous locus as described herein is a wild-type organism. In some embodiments, a reference organism that contains an endogenous locus as described herein is an engineered organism. In some embodiments, a reference organism that contains an endogenous locus as described herein is a laboratory-bred organism (whether wild-type or engineered).

The phrase “endogenous promoter” includes a promoter that is naturally associated, e.g., in a wild-type organism, with an endogenous gene.

The phrase “epitope-binding protein” includes a protein having at least one CDR and that is capable of selectively recognizing an epitope, e.g., is capable of binding an epitope with a KD that is at about one micromolar or lower (e.g., a K_(D) that is about 1×10⁻⁶ M, 1×10⁷ M, 1×10⁻⁸ M, 1×10⁻⁹ M, 1×10¹⁰ M, 1×10⁻¹¹ M, or about 1×10⁻¹² M). Therapeutic epitope-binding proteins (e.g., therapeutic antibodies) frequently require a K_(D) that is in the nanomolar or the picomolar range.

“Functional” as used herein, e.g., in reference to a functional polypeptide, includes a polypeptide that retains at least one biological activity normally associated with the native protein. In another instance, a functional immunoglobulin gene segment may include a variable gene segment that is capable of productive rearrangement to generate a rearranged immunoglobulin gene sequence.

The phrase “functional fragment” includes fragments of epitope-binding proteins that can be expressed, secreted, and specifically bind to an epitope with a K_(D) in the micromolar, nanomolar, or picomolar range. Specific recognition includes having a K_(D) that is at least in the micromolar range, the nanomolar range, or the picomolar range.

The term “germ line” in reference to an immunoglobulin nucleic acid sequence includes a nucleic acid sequence that can be passed to progeny.

The term “heterologous” as used herein includes an agent or entity from a different source. For example, when used in reference to a polypeptide, gene, or gene product present in a particular cell or organism, the term clarifies that the relevant polypeptide, gene, or gene product 1) was engineered by the hand of man; 2) was introduced into the cell or organism (or a precursor thereof) through the hand of man (e.g., via genetic engineering); and/or 3) is not naturally produced by or present in the relevant cell or organism (e.g., the relevant cell type or organism type).

The term “host cell”, as used herein, includes a cell into which a heterologous (e.g., exogenous) nucleic acid or protein has been introduced. Persons of skill upon reading this disclosure will understand that such terms refer not only to a particular subject cell, but also are used to refer to progeny of that cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still understood by those skilled in the art to be included within the scope of the term “host cell” as used herein. In some embodiments, a host cell is or comprises a prokaryotic or eukaryotic cell. In general, a host cell is any cell that is suitable for receiving and/or producing a heterologous nucleic acid or protein, regardless of the Kingdom of life to which the cell is designated. Exemplary cells that may be utilized as host cells in accordance with the present disclosure include those of prokaryotes and eukaryotes (single-cell or multiple-cell), bacterial cells (e.g., strains of E. coli, Bacillus spp., Streptomyces spp., etc.), mycobacteria cells, fungal cells, yeast cells (e.g., S. cerevisiae, S. pombe, P. pastoris, P. methanolica, etc.), plant cells, insect cells (e.g., SF-9, SF-21, baculovirus-infected insect cells, Trichoplusia ni, etc.), non-human animal cells, human cells, or cell fusions such as, for example, hybridomas or quadromas. In some embodiments, the cell is a human, monkey, ape, hamster, rat, or mouse cell. In some embodiments, the cell is eukaryotic and is selected from the following cells: CHO (e.g., CHO K1, DXB-11 CHO, Veggie-CHO), COS (e.g., COS-7), retinal cell, Vero, CV1, kidney (e.g., HEK293, 293 EBNA, MSR 293, MDCK, HaK, BHK), HeLa, HepG2, WI38, MRC 5, Colo205, HB 8065, HL-60, (e.g., BHK21), Jurkat, Daudi, A431 (epidermal), CV-1, U937, 3T3, L cell, C127 cell, SP2/0, NS-0, MMT 060562, Sertoli cell, BRL 3A cell, HT1080 cell, myeloma cell, tumor cell, and a cell line derived from an aforementioned cell. In some embodiments, the cell comprises one or more viral genes, e.g., a retinal cell that expresses a viral gene (e.g., a PER.C6™ cell). In some embodiments, a host cell is or comprises an isolated cell. In some embodiments, a host cell is part of a tissue. In some embodiments, a host cell is part of an organism.

The term “humanized” is used herein in accordance with its art-understood meaning and includes nucleic acids or proteins whose structures (i.e., nucleotide or amino acid sequences) include portions that correspond substantially or identically with versions of the relevant nucleic acids or proteins that are found in nature in non-human animals and that are distinguishable from corresponding versions that are found in nature in humans, and also include portions whose structures differ from those present in the non-human-animal versions and instead correspond more closely with comparable structures found in the human versions. In some embodiments, a “humanized” gene is one that encodes a polypeptide having substantially the amino acid sequence as that of a human polypeptide (e.g., a human protein or portion thereof e.g., characteristic portion thereof). To give but one example, in the case of a membrane receptor, a “humanized” gene may encode a polypeptide with an extracellular portion whose amino acid sequence is identical or substantially identical to that of a human extracellular portion, and whose remaining sequence is identical or substantially identical to that of a non-human (e.g., mouse) polypeptide. In some embodiments, a humanized gene comprises at least a portion of a DNA sequence of a human gene. In some embodiment, a humanized gene comprises an entire DNA sequence found in a human gene. In some embodiments, a humanized protein has an amino acid sequence that comprises a portion that appears in a human protein. In some embodiments, a humanized protein has an amino acid sequence whose entire sequence is found in a human protein. In some embodiments (including, for example, some in which a humanized protein has an amino acid sequence whose entire sequence is found in a human protein), a humanized protein is expressed from an endogenous locus of a non-human animal, which endogenous locus corresponds to the homolog or ortholog of the relevant human gene encoding the protein.

The term “identity” as used herein in connection with a comparison of sequences, includes identity as determined by any of a number of different algorithms known in the art that can be used to measure nucleotide and/or amino acid sequence identity. In some embodiments, identities as described herein are determined using a ClustalW v. 1.83 (slow) alignment employing an open gap penalty of 10.0, an extend gap penalty of 0.1, and using a Gonnet similarity matrix (MACVECTOR™ 10.0.2, MacVector Inc., 2008). As used herein, the term “identity” refers to the overall relatedness between polymeric molecules, e.g., between nucleic acid molecules (e.g., DNA molecules and/or RNA molecules) and/or between polypeptide molecules. In some embodiments, polymeric molecules are considered to be “substantially identical” to one another if their sequences are at least 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 99% identical. As will be understood by those skilled in the art, a variety of algorithms are available that permit comparison of sequences in order to determine their degree of homology, including by permitting gaps of designated length in one sequence relative to another when considering which residues “correspond” to one another in different sequences. Calculation of the percent identity between two nucleic acid sequences, for example, can be performed by aligning the two sequences for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a second nucleic acid sequences for optimal alignment and non-corresponding sequences can be disregarded for comparison purposes). In certain embodiments, the length of a sequence aligned for comparison purposes is at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or substantially 100% of the length of the reference sequence. The nucleotides at corresponding nucleotide positions are then compared. When a position in the first sequence is occupied by the same nucleotide as the corresponding position in the second sequence, then the molecules are identical at that position. The percent identity between the two sequences is a function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the length of each gap, which needs to be introduced for optimal alignment of the two sequences. Representative algorithms and computer programs useful in determining the percent identity between two nucleotide sequences include, for example, the algorithm of Meyers and Miller (CABIOS, 1989, 4: 11-17), which has been incorporated into the ALIGN program (version 2.0) using a PAM120 weight residue table, a gap length penalty of 12 and a gap penalty of 4. The percent identity between two nucleotide sequences can, alternatively, be determined for example using the GAP program in the GCG software package using an NWSgapdna.CMP matrix.

The term “isolated”, as used herein, includes a substance and/or entity that has been (1) separated from at least some of the components with which it was associated when initially produced (whether in nature and/or in an experimental setting), and/or (2) designed, produced, prepared, and/or manufactured by the hand of man. Isolated substances and/or entities may be separated from about 10%, about 20%, about 30%, about 40%, about 50%, about 60%, about 70%, about 80%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, about 99%, or more than about 99% of the other components with which they were initially associated. In some embodiments, isolated agents are about 80%, about 85%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, about 99%, or more than about 99% pure. As used herein, a substance is “pure” if it is substantially free of other components. In some embodiments, as will be understood by those skilled in the art, a substance may still be considered “isolated” or even “pure”, after having been combined with certain other components such as, for example, one or more carriers or excipients (e.g., buffer, solvent, water, etc.); in such embodiments, percent isolation or purity of the substance is calculated without including such carriers or excipients. To give but one example, in some embodiments, a biological polymer such as a polypeptide or polynucleotide that occurs in nature is considered to be “isolated” when, a) by virtue of its origin or source of derivation is not associated with some or all of the components that accompany it in its native state in nature; b) it is substantially free of other polypeptides or nucleic acids of the same species from the species that produces it in nature; c) is expressed by or is otherwise in association with components from a cell or other expression system that is not of the species that produces it in nature. Thus, for instance, in some embodiments, a polypeptide that is chemically synthesized or is synthesized in a cellular system different from that which produces it in nature is considered to be an “isolated” polypeptide. Alternatively or additionally, in some embodiments, a polypeptide that has been subjected to one or more purification techniques may be considered to be an “isolated” polypeptide to the extent that it has been separated from other components a) with which it is associated in nature; and/or b) with which it was associated when initially produced.

The phrase “light chain” includes an immunoglobulin light chain sequence from any organism, and unless otherwise specified includes human κ and λ light chains and a VpreB, as well as surrogate light chains. Light chain variable domains typically include three light chain CDRs and four framework (FR) regions, unless otherwise specified. Generally, a full-length light chain includes, from amino terminus to carboxyl terminus, a variable domain that includes FR1-CDR1-FR2-CDR2-FR3-CDR3-FR4, and a light chain constant region. A light chain variable domain is encoded by a light chain variable region gene sequence, which generally comprises V_(L) and J_(L) segments, derived from a repertoire of V and J segments present in the germ line. Sequences, locations and nomenclature for V and J light chain segments for various organisms can be found in IMGT database, www.imgt.org. Light chains include those, e.g., that do not selectively bind either a first or a second epitope selectively bound by the epitope-binding protein in which they appear. Light chains also include those that bind and recognize, or assist the heavy chain or another light chain with binding and recognizing, one or more epitopes selectively bound by the epitope-binding protein in which they appear. Conrinrion or universal light chains include those derived from a human Vκ1-39Jκ gene or a human Vκ3-20Jκ gene, and include somatically mutated (e.g., affinity matured) versions of the same. Exemplary human V_(L) segments include a human Vκ1-39 gene segment, a human Vκ3-20 gene segment, a human Vλ1-40 gene segment, a human Vλ1-44 gene segment, a human Vλ2-8 gene segment, a human Vλ2-14 gene segment, and human Vλ3-21 gene segment, and include somatically mutated (e.g., affinity matured) versions of the same. Light chains can be made that comprise a variable domain from one organism (e.g., human or rodent, e.g., rat or mouse; or bird, e.g., chicken) and a constant region from the same or a different organism (e.g., human or rodent, e.g., rat or mouse; or bird, e.g., chicken).

“Neutral pH” includes pH between about 7.0 and about 8.0, e.g., pH between about 7.0 and about 7.4, e.g., between about 7.2 and about 7.4, e.g., physiological pH. “Acidic pH” includes pH of 6.0 or lower, e.g., pH between about 5.0 and about 6.0, pH between about 5.75 and about 6.0, e.g., pH of endosomal or lysosomal compartments.

The phrase “non-human animal” as used herein includes a vertebrate organism that is not a human. In some embodiments, a non-human animal is a cyclostome, a bony fish, a cartilaginous fish (e.g., a shark or a ray), an amphibian, a reptile, a mammal (e.g., a rodent, e.g., a mouse or a rat), or a bird (e.g., a chicken). In some embodiments, a non-human mammal is a primate, a goat, a sheep, a pig, a dog, a cow, or a rodent. In some embodiments, a non-human animal is a rodent such as a rat or a mouse.

The phrase “operably linked”, as used herein, includes a physical juxtaposition (e.g., in three-dimensional space) of components or elements that interact, directly or indirectly with one another, or otherwise coordinate with each other to participate in a biological event, which juxtaposition achieves or permits such interaction and/or coordination. To give but one example, a control sequence (e.g., an expression control sequence) in a nucleic acid is said to be “operably linked” to a coding sequence when it is located relative to the coding sequence such that its presence or absence impacts expression and/or activity of the coding sequence. In many embodiments, “operable linkage” involves covalent linkage of relevant components or elements with one another. Those skilled in the art will readily appreciate, however, that in some embodiments, covalent linkage is not required to achieve effective operable linkage. For example, in some embodiments, nucleic acid control sequences that are operably linked with coding sequences that they control are contiguous with the gene of interest. Alternatively or additionally, in some embodiments, one or more such control sequences acts in trans or at a distance to control a coding sequence of interest. In some embodiments, the term “expression control sequence” as used herein refers to polynucleotide sequences which are necessary and/or sufficient to effect the expression and processing of coding sequences to which they are ligated. In some embodiments, expression control sequences may be or comprise appropriate transcription initiation, termination, promoter and/or enhancer sequences; efficient RNA processing signals such as splicing and polyadenylation signals; sequences that stabilize cytoplasmic mRNA; sequences that enhance translation efficiency (e.g., Kozak consensus sequence); sequences that enhance protein stability; and/or, in some embodiments, sequences that enhance protein secretion. In some embodiments, one or more control sequences are preferentially or exclusively active in a particular host cell or organism, or type thereof. To give but one example, in prokaryotes, control sequences typically include promoter, ribosomal binding site, and transcription termination sequence; in eukaryotes, in many embodiments, control sequences typically include promoters, enhancers, and/or transcription termination sequences. Those of ordinary skill in the art will appreciate from context that, in many embodiments, the term “control sequences” refers to components whose presence is essential for expression and processing, and in some embodiments includes components whose presence is advantageous for expression (including, for example, leader sequences, targeting sequences, and/or fusion partner sequences).

The term “recombinant”, as used herein, includes polypeptides (e.g., B cell activating factor proteins as described herein) that are designed, engineered, prepared, expressed, created or isolated by recombinant means, such as polypeptides expressed using a recombinant expression vector transfected into a host cell, polypeptides isolated from a recombinant, combinatorial human polypeptide library (Hoogenboom H. R., (1997) TIB Tech. 15:62-70; Azzazy H., and Highsmith W. E., (2002) Clin. Biochem. 35:425-445; Gavilondo J. V., and Larrick J. W. (2002) BioTechniques 29:128-145; Hoogenboom H., and Chames P. (2000) Immunology Today 21:371-378), antibodies isolated from an animal (e.g., a mouse) that is transgenic for human immunoglobulin genes (see e.g., Taylor, L. D., et al. (1992) Nucl. Acids Res. 20:6287-6295; Kellermann S-A., and Green L. L. (2002) Current Opinion in Biotechnology 13:593-597; Little M. et al (2000) Immunology Today 21:364-370) or polypeptides prepared, expressed, created or isolated by any other means that involves splicing selected sequence elements to one another. In some embodiments, one or more of such selected sequence elements is found in nature. In some embodiments, one or more of such selected sequence elements is designed in silico. In some embodiments, one or more such selected sequence elements results from mutagenesis (e.g., in vivo or in vitro) of a known sequence element, e.g., from a natural or synthetic source. For example, in some embodiments, a recombinant polypeptide is comprised of sequences found in the genome of a source organism of interest (e.g., human, mouse, etc.). In some embodiments, a recombinant polypeptide has an amino acid sequence that resulted from mutagenesis (e.g., in vitro or in vivo, for example in a non-human animal), so that the amino acid sequences of the recombinant polypeptides are sequences that, while originating from and related to polypeptides sequences, may not naturally exist within the genome of a non-human animal in vivo.

The term “replacement” is used herein includes a process through which a “replaced” nucleic acid sequence (e.g., a gene) found in a host locus (e.g., in a genome) is removed from that locus and a different, “replacement” nucleic acid is located in its place. In some embodiments, the replaced nucleic acid sequence and the replacement nucleic acid sequences are comparable to one another in that, for example, they are homologous to one another and/or contain corresponding elements (e.g., protein-coding elements, regulatory elements, etc.). In some embodiments, a replaced nucleic acid sequence includes one or more of a promoter, an enhancer, a splice donor site, a splice receiver site, an intron, an exon, an untranslated region (UTR); in some embodiments, a replacement nucleic acid sequence includes one or more coding sequences. In some embodiments, a replacement nucleic acid sequence is a homolog of the replaced nucleic acid sequence. In some embodiments, a replacement nucleic acid sequence is an ortholog of the replaced sequence. In some embodiments, a replacement nucleic acid sequence is or comprises a human nucleic acid sequence. In some embodiments, including where the replacement nucleic acid sequence is or comprises a human nucleic acid sequence, the replaced nucleic acid sequence is or comprises a rodent sequence (e.g., a mouse sequence). The nucleic acid sequence so placed may include one or more regulatory sequences that are part of source nucleic acid sequence used to obtain the sequence so placed (e.g., promoters, enhancers, 5′- or 3′-untranslated regions, etc.). For example, in various embodiments, the replacement is a substitution of an endogenous sequence with a heterologous sequence that results in the production of a gene product from the nucleic acid sequence so placed (comprising the heterologous sequence), but not expression of the endogenous sequence; the replacement is of an endogenous genomic sequence with a nucleic acid sequence that encodes a protein that has a similar function as a protein encoded by the endogenous sequence (e.g., the endogenous genomic sequence encodes a variable domain, and the DNA fragment encodes one or more human variable domains). In various embodiments, an endogenous gene or fragment thereof is replaced with a corresponding human gene or fragment thereof. A corresponding human gene or fragment thereof is a human gene or fragment that is an ortholog of, or is substantially similar or the same in structure and/or function, as the endogenous gene or fragment thereof that is replaced.

The term “heavy chain only antibody,” “heavy chain only antigen binding protein,” “single domain antigen binding protein,” “single domain binding protein” or the like refers to a monomeric or homodimeric immunoglobulin molecule comprising an immunoglobulin-like chain comprising a variable domain operably linked to a heavy chain constant region, that is unable to associate with a light chain because the heavy chain constant region typically lacks a functional C_(H)1 domain. Accordingly, the term “heavy chain only antibody,” “heavy chain only antigen binding protein,” “single domain antigen binding protein,” “single domain binding protein” or the like encompasses a both (i) a monomeric single domain antigen binding protein comprising one of the immunoglobulin-like chain comprising a variable domain operably linked to a heavy chain constant region lacking a functional C_(H)1 domain, or (ii) a homodimeric single domain antigen binding protein comprising two immunoglobulin-like chains, each of which comprising a variable domain operably linked to a heavy chain constant region lacking a functional C_(H)1 domain. In various aspects, a homodimeric single domain antigen binding protein comprises two identical immunoglobulin-like chains, each of which comprising an identical variable domain operably linked to an identical heavy chain constant region lacking a functional C_(H)1 domain. Additionally, each immunoglobulin-like chain of a single domain antigen binding protein comprises a variable domain, which may be derived from heavy chain variable region gene segments (e.g., V_(H), D_(H), J_(H)), light chain gene segments (e.g., V_(L), J_(L)), or a combination thereof, linked to a heavy chain constant region (C_(H)) gene sequence comprising a deletion or inactivating mutation in a C_(H)1 encoding sequence (and, optionally, a hinge region) of a heavy chain constant region gene, e.g., IgG, IgA, IgE, IgD, or a combination thereof. A single domain antigen binding protein comprising a variable domain derived from heavy chain gene segments may be referred to as a “V_(H)-single domain antibody” or “V_(H)-single domain antigen binding protein”, see, e.g., U.S. Pat. No. 8,754,287; U.S. Patent Publication Nos. 20140289876; 20150197553; 20150197554; 20150197555; 20150196015; 20150197556 and 20150197557, each of which is incorporated in its entirety by reference. A single domain antigen binding protein comprising a variable domain derived from light chain gene segments may be referred to as a or “V_(L)-single domain antigen binding protein,” see, e.g., U.S. Publication No. 20150289489, incorporated in its entirety by reference.

“Somatically mutated” includes reference to nucleic acid or amino acid sequences from affinity-matured B cells that are not identical to corresponding immunoglobulin variable region sequences in B cells that are not affinity-matured (i.e., sequences in the genome of germline cells). The phrase “somatically mutated” also includes reference to an immunoglobulin variable region nucleic acid or amino acid sequence from a B cell after exposure of the B cell to an epitope of interest, wherein the nucleic acid or amino acid sequence differs from the corresponding nucleic acid or amino acid sequence prior to exposure of the B cell to the epitope of interest. The phrase “somatically mutated” refers to sequences from binding proteins that have been generated in an animal, e.g., a mouse having human immunoglobulin variable region nucleic acid sequences, in response to an immunogen challenge, and that result from the selection processes inherently operative in such an animal.

The term “substantially” as used herein includes the qualitative condition of exhibiting total or near-total extent or degree of a characteristic or property of interest. One of ordinary skill in the biological arts will understand that biological and chemical phenomena rarely, if ever, go to completion and/or proceed to completeness or achieve or avoid an absolute result. The term “substantially” is therefore used herein to capture the potential lack of completeness inherent in many biological and chemical phenomena.

The phrase “substantial homology” as used herein includes a comparison between amino acid or nucleic acid sequences. As will be appreciated by those of ordinary skill in the art, two sequences are generally considered to be “substantially homologous” if they contain homologous residues in corresponding positions. Homologous residues may be identical residues. Alternatively, homologous residues may be non-identical residues will appropriately similar structural and/or functional characteristics. For example, as is well known by those of ordinary skill in the art, certain amino acids are typically classified as “hydrophobic” or “hydrophilic” amino acids, and/or as having “polar” or “non-polar” side chains. Substitution of one amino acid for another of the same type may often be considered a “homologous” substitution. As is well known in this art, amino acid or nucleic acid sequences may be compared using any of a variety of algorithms, including those available in commercial computer programs such as BLASTN for nucleotide sequences and BLASTP, gapped BLAST, and PSI-BLAST for amino acid sequences. Exemplary such programs are described in Altschul, et al., Basic local alignment search tool, J. Mol. Biol., 215(3): 403-410, 1990; Altschul, et al., Methods in Enzymology; Altschul, et al., “Gapped BLAST and PSI-BLAST: a new generation of protein database search programs”, Nucleic Acids Res. 25:3389-3402, 1997; Baxevanis, et al., Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins, Wiley, 1998; and Misener, et al., (eds.), Bioinformatics Methods and Protocols (Methods in Molecular Biology, Vol. 132), Humana Press, 1999. In addition to identifying homologous sequences, the programs mentioned above typically provide an indication of the degree of homology. In some embodiments, two sequences are considered to be substantially homologous if at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more of their corresponding residues are homologous over a relevant stretch of residues. In some embodiments, the relevant stretch is a complete sequence. In some embodiments, the relevant stretch is at least 9, 10, 11, 12, 13, 14, 15, 16, 17 or more residues. In some embodiments, the relevant stretch includes contiguous residues along a complete sequence. In some embodiments, the relevant stretch includes discontinuous residues along a complete sequence. In some embodiments, the relevant stretch is at least 10, 15, 20, 25, 30, 35, 40, 45, 50, or more residues.

The phrase “substantial identity” as used herein includes a comparison between amino acid or nucleic acid sequences. As will be appreciated by those of ordinary skill in the art, two sequences are generally considered to be “substantially identical” if they contain identical residues in corresponding positions. As is well known in this art, amino acid or nucleic acid sequences may be compared using any of a variety of algorithms, including those available in commercial computer programs such as BLASTN for nucleotide sequences and BLASTP, gapped BLAST, and PSI-BLAST for amino acid sequences.

Exemplary such programs are described in Altschul, et al., Basic local alignment search tool, J. Mol. Biol., 215(3): 403-410, 1990; Altschul, et al., Methods in Enzymology; Altschul et al., Nucleic Acids Res. 25:3389-3402, 1997; Baxevanis et al., Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins, Wiley, 1998; and Misener, et al., (eds.), Bioinformatics Methods and Protocols (Methods in Molecular Biology, Vol. 132), Humana Press, 1999. In addition to identifying identical sequences, the programs mentioned above typically provide an indication of the degree of identity. In some embodiments, two sequences are considered to be substantially identical if at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or more of their corresponding residues are identical over a relevant stretch of residues. In some embodiments, the relevant stretch is a complete sequence. In some embodiments, the relevant stretch is at least 10, 15, 20, 25, 30, 35, 40, 45, 50, or more residues.

The phrase “targeting vector” or “targeting construct” as used herein includes a polynucleotide molecule that comprises a targeting region. A targeting region comprises a sequence that is identical or substantially identical to a sequence in a target cell, tissue or animal and provides for integration of the targeting construct into a position within the genome of the cell, tissue or animal via homologous recombination. Targeting regions that target using site-specific recombinase recognition sites (e.g., loxP or Frt sites) are also included. In some embodiments, a targeting construct of the present invention further comprises a nucleic acid sequence or gene of particular interest, a selectable marker, control and or regulatory sequences, and other nucleic acid sequences that allow for recombination mediated through exogenous addition of proteins that aid in or facilitate recombination involving such sequences. In some embodiments, a targeting construct of the present invention further comprises a gene of interest in whole or in part, wherein the gene of interest is a heterologous gene that encodes a protein in whole or in part that has a similar function as a protein encoded by an endogenous sequence.

The term “unrearranged,” with reference to a nucleic acid sequence, includes nucleic acid sequences that exist, e.g., in a wild-type germ line of an animal cell.

The phrase “variable domain” includes an amino acid sequence of an immunoglobulin light or heavy chain (modified as desired) that comprises the following amino acid regions, in sequence from N-terminal to C-terminal (unless otherwise indicated): FR1, CDR1, FR2, CDR2, FR3, CDR3, FR4.

The term “variant”, as used herein, includes an entity that shows significant structural identity with a reference entity but differs structurally from the reference entity in the presence or level of one or more chemical moieties as compared with the reference entity. In many embodiments, a variant also differs functionally from its reference entity. In general, whether a particular entity is properly considered to be a “variant” of a reference entity is based on its degree of structural identity with the reference entity. As will be appreciated by those skilled in the art, any biological or chemical reference entity has certain characteristic structural elements. A variant, by definition, is a distinct chemical entity that shares one or more such characteristic structural elements. To give but a few examples, a small molecule may have a characteristic core structural element (e.g., a macrocycle core) and/or one or more characteristic pendent moieties so that a variant of the small molecule is one that shares the core structural element and the characteristic pendent moieties but differs in other pendent moieties and/or in types of bonds present (single vs. double, E vs. Z, etc.) within the core, a polypeptide may have a characteristic sequence element comprised of a plurality of amino acids having designated positions relative to one another in linear or three-dimensional space and/or contributing to a particular biological function, a nucleic acid may have a characteristic sequence element comprised of a plurality of nucleotide residues having designated positions relative to on another in linear or three-dimensional space. For example, a variant polypeptide may differ from a reference polypeptide as a result of one or more differences in amino acid sequence and/or one or more differences in chemical moieties (e.g., carbohydrates, lipids, etc.) covalently attached to the polypeptide backbone. In some embodiments, a variant polypeptide shows an overall sequence identity with a reference polypeptide that is at least 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, or 99%. Alternatively or additionally, in some embodiments, a variant polypeptide does not share at least one characteristic sequence element with a reference polypeptide. In some embodiments, the reference polypeptide has one or more biological activities. In some embodiments, a variant polypeptide shares one or more of the biological activities of the reference polypeptide. In some embodiments, a variant polypeptide lacks one or more of the biological activities of the reference polypeptide. In some embodiments, a variant polypeptide shows a reduced level of one or more biological activities as compared with the reference polypeptide. In many embodiments, a polypeptide of interest is considered to be a “variant” of a parent or reference polypeptide if the polypeptide of interest has an amino acid sequence that is identical to that of the parent but for a small number of sequence alterations at particular positions. Typically, fewer than 20%, 15%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2% of the residues in the variant are substituted as compared with the parent. In some embodiments, a variant has 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 substituted residue as compared with a parent. Often, a variant has a very small number (e.g., fewer than 5, 4, 3, 2, or 1) number of substituted functional residues (i.e., residues that participate in a particular biological activity). Furthermore, a variant typically has not more than 5, 4, 3, 2, or 1 additions or deletions, and often has no additions or deletions, as compared with the parent. Moreover, any additions or deletions are typically fewer than about 25, about 20, about 19, about 18, about 17, about 16, about 15, about 14, about 13, about 10, about 9, about 8, about 7, about 6, and commonly are fewer than about 5, about 4, about 3, or about 2 residues. In some embodiments, the parent or reference polypeptide is one found in nature. As will be understood by those of ordinary skill in the art, a plurality of variants of a particular polypeptide of interest may commonly be found in nature, particularly when the polypeptide of interest is an infectious agent polypeptide.

The term “vector”, as used herein, includes a nucleic acid molecule capable of transporting another nucleic acid to which it is associated. In some embodiment, vectors are capable of extra-chromosomal replication and/or expression of nucleic acids to which they are linked in a host cell such as a eukaryotic and/or prokaryotic cell. Vectors capable of directing the expression of operatively linked genes are referred to herein as “expression vectors.”

The term “wild-type”, as used herein, includes an entity having a structure and/or activity as found in nature in a “normal” (as contrasted with mutant, diseased, altered, etc.) state or context. Those of ordinary skill in the art will appreciate that wild type genes and polypeptides often exist in multiple different forms (e.g., alleles).

DETAILED DESCRIPTION OF CERTAIN EMBODIMENTS

The present invention provides, among other things, improved and/or engineered non-human animals having human genetic material encoding light chain variable domains (e.g., V_(L) regions). In certain embodiments, such non-human animals are useful, for example, for the production and isolation of (human) V_(L) domains that bind antigen independently. It is contemplated that such non-human animals provide a novel in vivo system for the generation and affinity maturation of human V_(L) domains that exhibit unique antigen-binding characteristics. Therefore, the present invention is particularly useful for the development of unique antigen-binding proteins in non-human animals. In particular, the present invention encompasses the humanization of a rodent immunoglobulin loci resulting in expression of antigen-binding proteins that resemble naturally occurring immunoglobulins in structure yet differ in binding characteristics, and resulting in expression of said antigen-binding proteins on the membrane surface of cells of the non-human animal. Such antigen-binding proteins have the capacity to recognize foreign antigens that may elude natural immunoglobulins in the generation of unique binding surfaces provided by the antigen-binding proteins. In some embodiments, non-human animals of the present invention are capable of generating (human) V_(L/CH×ULC) domains that bind to antigen independent of a cognate variable domain (e.g., a heavy chain variable domain); in some embodiments, such non-human animals develop and/or have a B cell population that express binding proteins that resemble immunoglobulins in structure yet are devoid of any heavy chain variable sequences. In some embodiments, antigen-binding proteins expressed by such non-human animals are characterized in that the antigen-binding portion comprises exclusively of (human) V_(L×ULC) domains. In some embodiments, non-human animals of the present invention comprise an endogenous immunoglobulin heavy chain locus that contains genetic material from the non-human animal and a heterologous species (e.g., a human) and comprise an endogenous immunoglobulin light chain locus that contains genetic material from the non-human animal and a heterologous species (e.g., human). In some embodiments, non-human animals of the present invention comprise a hybrid immunoglobulin chain locus that includes unrearranged human V_(L) and J_(L) gene segments operably linked to a heavy chain constant region encoding sequence and an immunoglobulin light chain locus that includes a single rearranged human or non-human V_(L)J_(L) sequence. In some embodiments, the expression of the antigen-binding proteins is under the control of non-human immunoglobulin genetic material (e.g., a non-human immunoglobulin promoter and/or enhancer).

Various aspects of the invention are described in detail in the following sections. The use of sections is not meant to limit the invention. Each section can apply to any aspect of the invention.

Non-Human Animals Comprising a High Diversity Hybrid Chain Locus Containing Unrearranged Light Chain Variable Region Gene Segments and a Low Diversity Light Chain Locus Containing a Rearranged Light Chain Variable Region Sequence

Generation of light chain variable regions that have an ability to bind an antigen independently from a cognate chain variable region can be useful for making light chain variable domains (V_(L)s) for use in antigen-binding molecules.

One approach to produce such light chain variable domains that can bind to an antigen independently from a cognate chain variable region is to apply a selective pressure on nucleotide sequences that encode a variable region or domain of a light chain (V_(L)) to generate light chain CDR3s with more diverse antigenic binding repertoire. As disclosed herein, this can be achieved by generating a genetically modified non-human animal that contains, in its genome, an immunoglobulin hybrid chain locus that contains a high diversity of unrearranged light chain gene segments, see, e.g., U.S. Patent Publication No. 20120096572, incorporated herein by reference, and an immunoglobulin light chain locus that has a low diversity in that the locus contains a single rearranged human immunoglobulin light chain variable region nucleotide sequence. Alternatively, in some embodiments, non-human animals as described herein contain an immunoglobulin light chain locus that has a low diversity in that the locus contains two or more but less than the wild type number of unrearranged human V_(L) gene segments (e.g., 2, 3 or 4). Since the light chain sequence (or the limited number of V_(L) gene segments) at the immunoglobulin light chain locus is restricted to a common or universal (i.e., the same or a very similar) sequences in these animals, the unrearranged light chain variable region nucleotide sequences (i.e., genes) at the hybrid locus will be forced to make light chain CDR3s with more diverse and efficient antigenic binding properties, which can hind an antigenic determinant independently from the cognate variable regions. Furthermore, as disclosed herein, the precise replacement of germ line variable region gene segments (e.g., by homologous recombination-mediated gene targeting) allows for making animals (e.g., mice, rats, or chickens) that have partly human immunoglobulin loci. Because the partly human immunoglobulin loci rearrange, hypermutate, and somatically mutate (e.g., class switch) normally, the partly human immunoglobulin loci generate binding proteins in the animal that comprise human variable domains (i.e., human V_(L) domains). These animals exhibit a humoral immune system that is substantially similar to wild type animals, and display normal cell populations and normal lymphoid organ structureseven where the animals lack a full repertoire of human variable region gene segments (at an immunoglobulin light chain locus). Immunizing these animals (e.g., mice, rats, or chickens) results in robust humoral responses that display a wide diversity of light chain variable gene segment usage. Nucleotide sequences that encode the variable regions can be identified and cloned, then fused (e.g., in an in vitro system) with any sequences of choice, e.g., any immunoglobulin isotype suitable for a particular use, resulting in an antibody or antigen-binding protein derived wholly from human sequences.

In addition, by utilizing animals (e.g., mice or rats or chickens) that have a restricted (limited) immunoglobulin light chain locus, e.g., a restricted immunoglobulin light chain locus comprising a rearranged light chain variable region nucleotide sequence (e.g., a universal light chain or “ULC,” US Patent Application Publication No. 2011-0195454 A1, US 2012-0021409A1, US 2012-0192300A1, US 2013-0045492A1, US 2013-0185821A1 and US 2013-0302836A1, incorporated by reference herein in their entireties) or a restricted (limited) immunoglobulin light chain variable region gene segment repertoire (e.g., a restricted immunoglobulin light chain variable segment repertoire comprising two or more but less than the wild type number of human V_(L) gene segments; for example, a dual light chain, or “DLC”, U.S. Patent Application Publication No. US-2013-0198880-A1, incorporated by reference herein in its entirety) in combination with a high diversity hybrid immunoglobulin chain locus containing unrearranged light chain variable region gene segments described above, an immunoglobulin light chain variable (V_(L/CH×ULC)) domain that binds antigen in the absence of a heavy chain variable domain can be produced. Furthermore, by introducing histidine codons, e.g., via addition of one or more histidine codons or substitution of one or more non-histidine codons with histidine codons, into the rearranged light chain variable region nucleotide sequence (or into the limited V_(L) gene segments) in the genome of the non-human animals described herein, light chain variable region amino acid sequences that can confer improved pH-dependent recyclability to the antigen-binding proteins can be generated.

In some embodiments, the genetically modified non-human animals as described herein provide a greater yield of binding proteins, while limiting diversity at the same time, thereby increasing the probability of successful production of light chain variable domains from the hybrid locus that bind antigen independent of a cognate variable domain. In some embodiments, the light chains may themselves exhibit antigen-binding properties. In some embodiments, the non-human animal may be induced to produce antigen-binding proteins exhibiting antigen specificity that resides in their light chains (e.g., by limiting a mouse or rat's immunoglobulin light chain repertoire and maximizing the immunoglobulin hybrid chain repertoire; e.g., by creating a hybrid immunoglobulin chain repertoire, e.g., by replacing the mouse or rat heavy chain variable region locus with a locus comprising a high diversity of unrearranged human V_(L) and J_(L) gene segments and replacing the mouse or rat light chain variable region locus a single rearranged human immunoglobulin light chain variable region nucleotide sequence). In some embodiments, antigen-binding proteins (e.g., antibodies) produced in such animals will be specific for a particular epitope (e.g., effector antigens, cytotoxic molecules, Fc receptors, toxins, activating or inhibitory receptors, T cell markers, immunoglobulin transporters, etc.) through their light chain binding.

In various aspects, a non-human animal is provided comprising in its germ line genome a hybrid immunoglobulin chain locus that comprises unrearranged (human) V_(L) and J_(L) gene segments operably linked to a heavy chain constant region encoding sequence and an immunoglobulin light chain locus that comprises a rearranged human or non-human immunoglobulin light chain variable region nucleotide sequence (i.e., a rearranged light chain VJ sequence). In some embodiments, the unrearranged (human) V_(L) and J_(L) gene segments are operably linked to a human or non-human heavy chain constant region sequence comprising one or more heavy chain constant region genes, each of which encodes at least a functional C_(H)1 domain, and the rearranged (human) immunoglobulin light chain variable region nucleotide sequence is operably linked to a human or a non-human light chain constant region sequence. In some embodiments, an immunoglobulin light chain variable domain encoded by the rearranged light chain variable region nucleotide sequence is not immunogenic to the non-human animal. In some embodiments, the non-human animal is modified to comprise a nucleotide sequence that encodes two copies, three copies, four copies or more of the rearranged light chain variable domain operably linked to a light chain constant domain. In some embodiments, the nucleotide sequence encodes a plurality of copies of the rearranged (human) immunoglobulin light chain variable region nucleotide sequence. For example, the nucleotide sequence can encode at least one, two, three, four, five copies of the rearranged human immunoglobulin light chain variable region nucleotide sequence. In some embodiments, the nucleotide sequence encodes 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 copies of the rearranged (human) immunoglobulin light chain variable region nucleotide sequence. In some embodiments, the immunoglobulin light chain locus comprises a plurality of copies of the rearranged (human) inununoglobulin light chain variable region nucleotide sequence operably linked to a light chain constant region gene sequence.

In various aspects, the immunoglobulin light chain locus of the non-human animals described herein comprises a single rearranged human immunoglobulin light chain variable region nucleotide sequence, e.g., a rearranged human V_(L)/J_(L) sequence, operably linked to a non-human light chain constant region nucleotide sequence (e.g., a non-human light chain constant region nucleic acid sequence). Thus, genetically modified non-human animals are provided comprising in their genomes: (i) a hybrid immunoglobulin chain locus that comprises unrearranged human V_(L) and J_(L) gene segments operably linked to a human or non-human heavy chain constant region nucleic acid sequence; and (ii) an immunoglobulin light chain locus comprising a rearranged human light chain variable region nucleotide sequence operably linked to a light chain constant region nucleic acid sequence. In some embodiments, the light chain constant region is a rat or a mouse constant region, e.g., a rat or a mouse Cκ constant region. In some embodiments, the human V_(L) and J_(L) gene segments at an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus) are present as a plurality of gene segments (more than one human V_(L) and more than one human J_(L) gene segment) and capable of rearranging and encoding human V_(L) domains in the context of heavy chain constant regions of an antibody, and the non-human animal does not comprise an endogenous V_(H) and/or V_(L) gene segment. In some embodiments, the non-human animal comprises six, 16, 30, 40 or more unrearranged human Vκ gene segments at an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus). In some embodiments, the non-human animal comprises five unrearranged human Jκ gene segments, e.g., Jκ1, Jκ2, Jκ3, Jκ4, and Jκ5 gene segments at an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus). In some embodiments, the non-human animal comprises 12, 28, 40 or more unrearranged human Vλ gene segments at an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus). In some embodiments, the non-human animal comprises 1, 2, 3, 4 or more unrearranged human Jλ gene segments, e.g., Jλ1, Jλ2, Jλ3, Jλ7, etc., at an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus).

In some embodiments, the immunoglobulin light chain locus of the non-human animals described herein comprises a rearranged human VκJκ nucleotide sequence. In some embodiments, the immunoglobulin light chain locus comprises a rearranged human VλJλ nucleotide sequence. In some embodiments, the rearranged human VκJκ nucleotide sequence or rearranged human VλJλ nucleotide sequence is present at an endogenous light chain locus, e.g., at an endogenous κ light chain locus. In some embodiments, the mouse comprises a functional λ light chain locus. In some embodiments, the mouse comprises a non-functional λ light chain locus. In some embodiments, the one or more human V_(L) and one or more human J_(L) gene segments at the immunoglobulin heavy chain locus are operably linked to a mouse or a rat heavy chain constant region sequence (e.g., in a hybrid immunoglobulin chain locus). In some embodiments, the rearranged human VκJκ nucleotide sequence is a rearranged human Vκ1-39Jκ nucleotide sequence, e.g., Vκ1-39Jκ5 sequence (e.g., as set forth in SEQ ID NO:1). In some embodiments, the rearranged human VκJκ nucleotide sequence is a rearranged human Vλ3-20Jκ nucleotide sequence, e.g., Vκ3-20J sequence (e.g., as set forth in SEQ ID NO:2). In some embodiments, the rearranged human VλJλ nucleotide sequence is a rearranged human Vλ2-14Jλ1 nucleotide sequence. As persons of skill will recognize the use of other J_(L) sequences may be employed in a rearranged light chain sequence.

In various aspects, the immunoglobulin light chain locus of the non-human animals described herein comprises a limited repertoire of immunoglobulin light chain variable gene segments, e.g., one or more but less than the wild type number of human V_(L) gene segments; and one or more human J_(L) gene segments, operably linked to a non-human light chain constant region nucleotide sequence. Thus, genetically modified non-human animals are provided comprising in their genomes: (i) an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus) that comprises unrearranged human V_(L) and J_(L) gene segments operably linked to a human or non-human heavy chain constant region nucleic acid sequence (e.g., a non-human heavy chain constant region nucleic acid sequence encoding a CH1, hinge, CH2, CH3, CH4, or a combination thereof, e.g., a CH1, a hinge, an CH2, and a CH3); and (ii) an immunoglobulin light chain locus comprising two or more but less than the wild type number of human immunoglobulin V_(L) and J_(L) gene segments operably linked to a light chain constant region nucleic acid sequence. In some embodiments, the light chain constant region is a rat or a mouse constant region, e.g., a rat or a mouse Cκ constant region. In some embodiments, the human V_(L) and J_(L) gene segments at an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus) are present as a plurality of gene segments (more than one human V_(L) and more than one human J_(L) gene segment) and capable of rearranging and encoding human V_(L) domains in the context of heavy chain constant regions of an antibody, and the non-human animal does not comprise an endogenous V_(H) and/or V_(L) gene segment. In some embodiments, the non-human animal comprises six, 16, 30, 40 or more unrearranged human Vκ gene segments at an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus). In some embodiments, the non-human animal comprises five unrearranged human Jκ gene segments, e.g., Jκ1, Jκ2, Jκ3, Jκ4, and Jκ5 gene segments at an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus). In some embodiments, the non-human animal comprises 12, 28, 40 or more unrearranged human Vλ gene segments at an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus). In some embodiments, the non-human animal comprises 1, 2, 3, 4 or more unrearranged human Jλ gene segments, e.g., Jλ1, Jλ2, Jλ3, Jλ7, etc., at an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus). In some embodiments, the non-human animal comprises two unrearranged human Vκ gene segments at an immunoglobulin light chain locus. In some embodiments, the non-human animal comprises two unrearranged human Vλ gene segments at an immunoglobulin light chain locus.

In some embodiments, genetically modified mice comprising in their genomes (i) an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus) that comprises unrearranged human V_(L) and J_(L) gene segments operably linked to a human or non-human heavy chain constant region nucleic acid sequence, and (ii) an immunoglobulin light chain locus comprising rearranged human light chain variable region nucleic acid sequence operably linked to a light chain constant region nucleic acid sequence, demonstrate CD19⁺ B cell numbers and mature B cell numbers that are substantially the same as the numbers observed in wild type mice or mice containing other modifications of their immunoglobulin loci (i.e., genetically modified control mice; e.g., VELOCIMMUNE® humanized mice, in which the humoral immune system of the mouse functions like that of a wild type mouse). In some embodiments, such mice also demonstrate a functional silencing of endogenous lambda light chains in splenic B cells. In some embodiments, the mice exhibit normal or nearly normal B cell development in the bone marrow and the spleen. In some embodiments, such mice exhibit a lack of detectable expression and/or usage (or functional silencing) of lambda light chains compared to genetically modified control mice.

In another aspect, a non-human animal is provided comprising (a) a genetically modified immunoglobulin heavy chain locus comprising: a first nucleotide sequence that encodes a light chain variable domain (e.g., where the first nucleotide sequence contains unrearranged human immunoglobulin light chain variable region gene segments), wherein the first nucleotide sequence is operably linked to a heavy chain constant region gene sequence comprising one or more heavy chain constant region genes each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising at least an intact Igμ gene and at least one of an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and an intact Igα gene (thus, resulting in, e.g., a hybrid immunoglobulin chain locus); and (b) genetically modified immunoglobulin light chain locus comprising a second nucleotide sequence that encodes a human light chain variable domain (e.g., where the second nucleotide sequence is a rearranged human immunoglobulin light chain variable region nucleotide sequence or where the second nucleotide sequence contains a limited number of human V_(L) gene segments; e.g., two or more but less than the wild type number of human V_(L) gene segments), wherein the second nucleotide sequence is operably linked to a light chain constant region gene sequence. For example, in some embodiments, a rearranged light chain from a pre-designed VJ region (i.e., a rearranged human immunoglobulin light chain variable region nucleotide sequence; i.e., a common or universal light chain sequence) or a limited number of human V_(L) gene segments (e.g., two or more but less than the wild type number of human V_(L) gene segments) can be operably linked to a light chain constant region gene sequence by targeting the rearranged light chain sequence into a mouse light chain locus, either κ or λ. Thus, as in other embodiments, this genetically engineered immunoglobulin light chain locus may be present in the germ line genome of the non-human animal. Genetically modified non-human animals comprising unrearranged human immunoglobulin light chain variable region nucleotide sequences in operable linkage with a heavy chain constant region gene sequences are described in U.S. Patent Application Publication No. 2012-0096572 A1, which is incorporated herein by reference. In some embodiments, the second nucleotide sequence that encodes the human light chain variable domain is operably linked to a κ light chain constant (i.e., Cκ) region gene sequence. In some embodiments, the second nucleotide sequence that encodes the human light chain variable domain is operably linked to a mouse or rat Cκ region gene sequence. In some embodiments, the second nucleotide sequence that encodes the light chain variable domain is operably linked to a human Cκ region gene sequence. In some embodiments, the second nucleotide sequence that encodes the human light chain variable domain is operably linked to a Cλ region gene sequence. In some embodiments, the second nucleotide sequence that encodes the human light chain variable domain is operably linked to a mouse or rat Cλ region gene sequence. In some embodiments, the second nucleotide sequence that encodes the human chain variable domain is operably linked to a human region gene sequence.

In some embodiments, the non-human animal is a mammal. Although embodiments employing a rearranged human light chain variable region (or a limited number of human V_(L) gene segments) and unrearranged human light chain variable region gene segments in a mouse (i.e., a mouse with an immunoglobulin light locus comprising a rearranged human immunoglobulin light chain variable region nucleotide sequence (or a limited number of human V_(L) gene segments) and an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus) comprising unrearranged human light chain variable region gene segments) are extensively discussed herein, other non-human animals that comprise a genetically modified immunoglobulin heavy and light chain loci as described herein are also provided. Such non-human animals include any of those which can be genetically modified to express the rearranged human immunoglobulin light chain variable region nucleotide sequence (or a human light chain variable domain from the limited number of human V_(L) gene segments) as disclosed herein, including, e.g., mammals, e.g., mouse, rat, rabbit, pig, bovine (e.g., cow, bull, buffalo), deer, sheep, goat, chicken, cat, dog, ferret, primate (e.g., marmoset, rhesus monkey), etc. For example, for those non-human animals for which suitable genetically modifiable ES cells are not readily available, other methods are employed to make a non-human animal comprising the genetic modification. Such methods include, e.g., modifying a non-ES cell genome (e.g., a fibroblast or an induced pluripotent cell) and employing somatic cell nuclear transfer (SCNT) to transfer the genetically modified genome to a suitable cell, e.g., an enucleated oocyte, and gestating the modified cell (e.g., the modified oocyte) in a non-human animal under suitable conditions to form an embryo. Methods for modifying a non-human animal genome (e.g., a pig, cow, rodent, chicken, etc. genome) include, e.g., employing a zinc finger nuclease (ZFN) or a transcription activator-like effector nuclease (TALEN) to modify a genome to include an immunoglobulin light chain locus that contains a rearranged human immunoglobulin light chain variable region nucleotide sequence (or a limited number of human V_(L) gene segments, e.g., two or more but less than the wild type number of human V_(L) gene segments) and an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus) that contains unrearranged human light chain variable region gene segments.

In some embodiments, the non-human animal is a small mammal, e.g., of the superfamily Dipodoidea or Muroidea. In some embodiments, the genetically modified animal is a rodent. In some embodiments, the rodent is selected from a mouse, a rat, and a hamster. In some embodiments, the rodent is selected from the superfamily Muroidea. In some embodiments, the genetically modified animal is from a family selected from Calomyscidae (e.g., mouse-like hamsters), Cricetidae (e.g., hamster, New World rats and mice, voles), Muridae (true mice and rats, gerbils, spiny mice, crested rats), Nesomyidae (climbing mice, rock mice, white-tailed rats, Malagasy rats and mice), Platacanthomyidae (e.g., spiny dormice), and Spalacidae (e.g., mole rates, bamboo rats, and zokors). In a specific embodiment, the genetically modified rodent is selected from a true mouse or rat (family Muridae), a gerbil, a spiny mouse, and a crested rat. In some embodiments, the genetically modified mouse is from a member of the family Muridae. In some embodiments, the animal is a rodent. In specific embodiments, the rodent is selected from a mouse and a rat. In some embodiments, the non-human animal is a mouse.

In some embodiments, the non-human animal is a rodent that is a mouse of a C57BL strain selected from C57BL/A, C57BL/An, C57BL/GrFa, C57BL/KaLwN, C57BL/6, C57BL/6J, C57BL/6ByJ, C57BL/6N, C57BL/6NJ, C57BL/10, C57BL/10ScSn, C57BL/10Cr, and C57BL/Ola. In another embodiment, the mouse is a 129 strain. In some embodiments, the 129 strain is selected from the group consisting of 129P1, 129P2, 129P3, 129X1, 129S1 (e.g., 129S1/SV, 129S1/SvIm), 129S2, 129S4, 129S5, 129S9/SvEvH, 129S6 (129/SvEvTac), 129S7, 129S8, 129T1, 129T2 (see, e.g., Festing et al. (1999) Revised nomenclature for strain 129 mice, Mammalian Genome 10:836, see also, Auerbach et al. (2000) Establishment and Chimera Analysis of 129/SvEv- and C57BL/6-Derived Mouse Embryonic Stem Cell Lines). In some embodiments, the genetically modified mouse is a mix of an aforementioned 129 strain and an aforementioned C57BL strain (e.g., a C57BL/6 strain). In another embodiment, the mouse is a mix of aforementioned 129 strains, or a mix of aforementioned C57BL/6 strains. In some embodiments, the 129 strain of the mix is a 129S6 (129/SvEvTac) strain. In another embodiment, the mouse is a mix of a 129/SvEv- and a C57BL/6-derived strain. In a specific embodiment, the mouse is a mix of a 129/SvEv- and a C57BL/6-derived strain as described in Auerbach et al. 2000 BioTechniques 29:1024-1032. In another embodiment, the mouse is a BALB strain, e.g., BALB/c strain. In another embodiment, the mouse is a mix of a BALB strain (e.g., BALB/c strain) and another aforementioned strain.

In some embodiments, the non-human animal is a rat. In some embodiments, the rat is selected from a Wistar rat, an LEA strain, a Sprague Dawley strain, a Fischer strain, F344, F6, ACI, and Dark Agouti (DA). In some embodiments, the rat strain is a mix of two or more of a strain selected from the group consisting of Wistar, LEA, Sprague Dawley, Fischer, F344, F6, ACI and Dark Agouti (DA).

In some embodiments, such a genetically modified mouse uses lambda gene sequences with a frequency that is half or less than half of the frequency that lambda gene sequences are used in wild type.

In various embodiments, as described herein, the rearranged light chain variable domain is derived from a human V_(L) and J_(L) gene sequence or segment. In other embodiments, the rearranged light chain variable domain is derived from a non-human V_(T), and J_(L) gene sequence or segment. In some embodiments, the rearranged light chain variable domain is derived from a human germ line V_(L) segment and a human germ line J_(L) segment. In some embodiments, the human V_(L) segment corresponds to observed variants in the human population.

In various embodiments, as described herein, the human V_(L) gene segment of the rearranged light chain variable region nucleotide sequence is a human Vκ gene segment. In some embodiments, the human Vκ gene segment is selected from the group consisting of Vκ4-1, Vκ5-2, Vκ7-3, Vκ2-4, Vκ1-5, Vκ1-6 Vκ3-7, Vκ1-8, Vκ1-9, Vκ2-10, Vκ3-11, Vκ1-12, Vκ1-13, Vκ2-14, Vκ3-15, Vκ1-16, Vκ1-17, Vκ2-18, Vκ2-19, Vκ3-20, Vκ6-21, Vκ1-22, Vκ1-23, Vκ2-24, Vκ3-25, Vκ2-26, Vκ1-27, Vκ2-28, Vκ2-29, Vκ2-30, Vκ3-31, Vκ1-32, Vκ1-33, Vκ3-34, Vκ1-35, Vκ2-36, Vκ1-37, Vκ2-38, Vκ1-39, Vκ2-40, and a polymorphic variant thereof. In some embodiments, the human Vκ segment is Vκ1-39 or polymorphic variant thereof. In some embodiments, the human Vκ gene segment is Vκ3-20.

In various embodiments, as described herein, the human V_(L) gene segments of the restricted (limited) immunoglobulin light chain variable region gene segment repertoire (e.g., a restricted immunoglobulin light chain variable segment repertoire comprising two or more but less than the wild type number of human V_(L) gene segments) are human Vκ gene segments. In some embodiments, the human Vκ gene segments are selected from human Vκ gene segments described herein. In some certain embodiments, the human Vκ gene segments of the restricted (limited) immunoglobulin light chain variable region gene segment repertoire include a human Vκ1-39 gene segment and a human Vκ3-20 gene segment. In various embodiments of the restricted (limited) immunoglobulin light chain variable gene segment non-human animal, the restricted light chain variable gene segments (e.g., a human Vκ1-39 gene segment and a human Vκ3-20 gene segment) are operably linked to one, two, three, four, or more human J_(L) gene segments; such that the restricted immunoglobulin light chain variable gene segments recombine with one of the one or two or three or four or more human J_(L) gene segments (i.e., Jκ gene segments) to form a rearranged VκJκ light chain variable gene.

In various embodiments, as described herein, the human V_(L) gene segment of the rearranged light chain variable region nucleotide sequence is a human Vλ gene segment. In some embodiments, the human Vλ gene segment is selected from the group consisting of Vλ3-1, Vλ4-3, Vλ2-8, Vλ3-9, Vλ3-10, Vλ2-11, Vλ3-12, Vλ2-14, Vλ3-16, Vλ2-18, Vλ3-21, Vλ3-22, Vλ2-23, Vλ3-25, Vλ3-27, Vλ1-36, Vλ5-37, Vλ5-39, Vλ7-43, Vλ1-44, VAS-45, Vλ1-47, Vλ9-49, Vλ1-51, Vλ5-52, Vλ6-57, Vλ4-60, Vλ8-61, Vλ4-69, and a polymorphic variant thereof. In some embodiments, the human Vλ segment is Vλ2-14.

In various embodiments, as described herein, the human V_(L) gene segments of the restricted (limited) immunoglobulin light chain variable region gene segment repertoire (e.g., a restricted immunoglobulin light chain variable segment repertoire comprising two or more but less than the wild type number of human V_(L) gene segments) are human Vλ gene segments. In some embodiments, the human Vλ gene segments are selected from human Vλ gene segments described herein. In some certain embodiments, the human Vλ gene segments of the restricted (limited) immunoglobulin light chain variable region gene segment repertoire include a human Vλ2-14 gene segment.

In various embodiments, as described herein, the human J_(L) gene segment of the rearranged light chain variable region nucleotide sequence is selected from the group consisting of Jκ1, Jκ2, Jκ3, Jκ4, Jκ5, Jλ1, Jλ2, Jλ3, Jλ7, and a polymorphic variant thereof.

In various embodiments, as described herein, the human J_(L) gene segments of the restricted (limited) immunoglobulin light chain variable region gene segment repertoire include human Jκ1, Jκ2, Jκ3, Jκ4, Jκ5, and a polymorphic variant thereof. In various embodiments, as described herein, the human J_(L) gene segments of the restricted (limited) immunoglobulin light chain variable region gene segment repertoire include human Jλ1, Jλ2, Jλ3, Jλ7, and a polymorphic variant thereof.

In some embodiments, the human or non-human animal light chain constant region sequence comprises a sequence selected from a Cκ and a Cλ region.

Various embodiments utilize or encompass features or sequence information derived from VELOCIMMUNE® humanized mice. VELOCIMMUNE® humanized mice contain a precise, large-scale replacement of germ line variable regions of mouse immunoglobulin heavy chain (IgH) and immunoglobulin light chain (e.g., κ light chain, Igκ) with corresponding human immunoglobulin variable regions, at the endogenous loci (see, e.g., U.S. Pat. Nos. 6,596,541 and 8,502,018, the entire contents of which are incorporated herein by reference). In total, about six megabases of mouse loci are replaced with about 1.5 megabases of human genomic sequence. This precise replacement results in a mouse with hybrid immunoglobulin loci that make heavy and light chains that have a human variable regions and a mouse constant region. The precise replacement of mouse V_(H)-D-J_(H) and Vκ-Jκ segments leave flanking mouse sequences intact and functional at the hybrid immunoglobulin loci. The humoral immune system of the mouse functions like that of a wild type mouse. B cell development is unhindered in any significant respect and a rich diversity of human variable regions is generated in the mouse upon antigen challenge. Moreover, VELOCIMMUNE® humanized mice display an essentially normal, wild-type response to immunization that differs only in one significant respect from wild type mice the variable regions generated in response to immunization are fully human. VELOCIMMUNE® humanized mice are possible because immunoglobulin gene segments for heavy and κ light chains rearrange similarly in humans and mice. Although the loci are not identical, they are similar enough that humanization of the heavy chain variable gene locus can be accomplished by replacing about three million base pairs of contiguous mouse sequence that contains all the V_(H), D, and J_(H) gene segments with about one million bases of contiguous human genomic sequence covering basically the equivalent sequence from a human immunoglobulin locus.

In particular embodiments, a humanized mouse comprising an immunoglobulin heavy chain locus that contains unrearranged human light chain variable region gene segments (i.e., comprising an immunoglobulin heavy chain locus that comprises unrearranged human immunoglobulin V_(L) and J_(L) gene segments) is provided. A humanized mouse so modified comprises a replacement of mouse immunoglobulin heavy chain variable region gene segments with unrearranged human immunoglobulin light chain variable region gene segments (i.e., unrearranged V_(L) and J_(L) gene at an endogenous heavy chain locus), and a replacement of mouse immunoglobulin light chain variable gene segments with a rearranged human V_(L)J_(L) nucleotide sequence or a replacement of mouse immunoglobulin light chain variable gene segments with a restricted (limited) immunoglobulin light chain variable region gene segment repertoire (e.g., two or more but less than the wild type number of human V_(L) gene segments).

In some embodiments, the mouse so modified comprises a replacement of mouse immunoglobulin heavy chain variable region gene segments with at least 40 unrearranged human Vκ gene segments and five unrearranged human Jκ gene segments. In some embodiments, the unrearranged human Vκ gene segments are selected from the group consisting of Vκ1-5, Vκ1-6, Vκ1-8, Vκ1-9, Vκ1-12, Vκ1-13, Vκ1-16, Vκ1-17, Vκ1-22, Vκ1-27, Vκ1-32, Vκ1-33, Vκ1-35, Vκ1-37, Vκ1-39, Vκ1 D-8, Vκ1D-12, Vκ1D-16, Vκ1D-17, Vκ1D-22, Vκ1D-27, Vκ1D-32, Vκ1D-33, Vκ1D-35, Vκ1D-37, Vκ1D-39, Vκ1D-42, Vκ1D-43, Vκ1-NL1, Vκ2-4, Vκ2-10, Vκ2-14, Vκ2-18, Vκ2-19, Vκ2-23, Vκ2-24, Vκ2-26, Vκ2-28, Vκ2-29, Vκ2-30, Vκ2-36, Vκ2-38, Vκ2-40, Vκ2D-10, Vκ2D-14, Vκ2D-18, Vκ2D-19, Vκ2D-23, Vκ2D-24, Vκ2D-26, Vκ2D-28, Vκ2D-29, Vκ2D-30, Vκ2D-36, Vκ2D-38, Vκ2D-40, Vκ3-7, Vκ3-11, Vκ3-15, Vκ3-20, Vκ3-25, Vκ3-31, Vκ3-34, Vκ3D-7, Vκ3D-7, Vκ3D-11, Vκ3D-15, Vκ3D-15, Vκ3D-20, Vκ3D-25, Vκ3D-31, Vκ3D-34, Vκ3-NL1, Vκ3-NL2, Vκ3-NL3, Vκ3-NL4, Vκ3-NL5, Vκ4-1, Vκ6-21, Vκ6D-21, Vκ6D-41, and Vκ7-3. In some embodiments, the human Vκ gene segments comprise Vκ4-1, Vκ5-2, Vκ7-3, Vκ2-4, Vκ1-5, and Vκ1-6. In one embodiment, the Vκ gene segments comprise Vκ3-7, Vκ1-8, Vκ1-9, Vκ2-10, Vκ3-11, Vκ1-12, Vκ1-13, Vκ2-14, Vii-15 and Vκ1-16. In some embodiments, the human Vκ gene segments comprise Vκ1-17, Vκ2-18, Vκ2-19, Vκ3-20, Vκ6-21, Vκ1-22, Vκ1-23, Vκ2-24, Vκ3-25, Vκ2-26, Vκ1-27, Vκ2-28, Vκ2-29, and Vκ2-30. In some embodiments, the human Vκ gene segments comprise Vλ3-31, Vκ1-32, Vκ1-33, Vκ3-34, Vκ1-35, Vκ2-36, Vκ1-37, Vκ2-38, Vκ1-39, and Vκ2-40. In specific embodiments, the Vκ gene segments comprise contiguous human immunoglobulin κ gene segments spanning the human immunoglobulin κ light chain locus from Vκ4-1 through Vκ2-40, and the Jκ gene segments comprise contiguous gene segments spanning the human immunoglobulin κ light chain locus from Jκ1 through Jκ5. In some embodiments, the rearranged human light chain variable region nucleotide sequence (i.e., rearranged human VκJκ nucleotide sequence) is operably linked to a mouse light chain constant region sequence (e.g., a Cκ sequence). A humanized mouse comprising an immunoglobulin heavy chain locus encoding human light chain variable domains (i.e., comprising an immunoglobulin heavy chain locus that comprises unrearranged human immunoglobulin light chain variable region gene segments) can be used in any of the aspects, embodiments, methods, etc. described herein.

In some embodiments, the mouse so modified comprises a replacement of mouse immunoglobulin heavy chain variable region gene segments with at least 40 unrearranged human Vλ gene segments and one or more unrearranged human Jλ gene segments; in some certain embodiments, at least 40 unrearranged human Vλ gene segments and four unrearranged human Jλ gene segments. In some embodiments, the unrearranged human Vκ gene segments are selected from the group consisting of Vλ3-1, Vλ4-3, Vλ2-8, Vλ3-9, Vλ3-10, Vλ2-11, Vλ3-12, Vλ2-14, Vλ3-16, Vλ2-18, Vλ3-19, Vλ3-21, Vλ3-22, Vλ2-23, Vλ3-25, Vλ3-27, Vλ1-36, Vλ5-37, Vλ5-39, Vλ1-40, Vλ7-43, Vλ5-45, Vλ7-46, Vλ9-49, Vλ1-51, Vλ5-52, Vλ6-57, Vλ4-60, Vλ8-61, Vλ4-69, and a polymorphic variant thereof. In some embodiments, the unrearranged human Vλ gene segments include Vλ3-1, Vλ2-8, Vλ3-9, Vλ3-10, Vλ2-11 and Vλ3-12. In some embodiments, the unrearranged human Vλ gene segments include V Vλ2-14, Vλ3-16, Vλ2-18, Vλ3-19, Vλ3-21, Vλ3-22, Vλ2-23, Vλ3-25 and Vλ3-27. In some embodiments, the unrearranged human Vλ gene segments include V Vλ1-36, Vλ5-37, Vλ5-39, Vλ1-40, Vλ7-43, Vλ1-44, Vλ5-45, Vλ7-46, Vλ1-47, Vλ9-49, Vλ1-51, Vλ5-52, Vλ6-57, Vλ4-60, Vλ8-61 and Vλ4-69. In specific embodiments, the Vλ gene segments comprise contiguous human immunoglobulin λ gene segments spanning the human immunoglobulin λ light chain locus from Vλ3-1 through Vλ3-12, and the Jλ gene segments include Jλ1. In specific embodiments, the Vλ gene segments comprise contiguous human immunoglobulin λ gene segments spanning the human immunoglobulin λ light chain locus from Vλ3-12 through Jλ1. In specific embodiments, the Vλ gene segments comprise contiguous human immunoglobulin λ gene segments spanning the human immunoglobulin λ light chain locus from Vλ3-1 through Vλ3-12, and the Jλ gene segments include Jλ1, Jλ2, Jλ3 and Jλ7. In specific embodiments, the Vλ gene segments comprise contiguous human immunoglobulin λ gene segments spanning the human immunoglobulin λ light chain locus from Vλ3-12 through Vλ3-27, and the Jλ gene segments include Jλ1 or Jλ1, Jλ2, Jλ3 and Jλ7. In specific embodiments, the Vλ gene segments comprise contiguous human immunoglobulin λ gene segments spanning the human immunoglobulin λ light chain locus from Vλ1-40 through Vλ5-52, and the Jλ gene segments include Jλ1 or Jλ1, Jλ2, Jλ3 and Jλ7. In some embodiments, the rearranged human light chain variable region nucleotide sequence is a rearranged human VλJλ nucleotide sequence and is operably linked to a mouse light chain constant region sequence (e.g., a Cλ sequence). A humanized mouse comprising an immunoglobulin heavy chain locus encoding human light chain variable domains (i.e., comprising an immunoglobulin heavy chain locus that comprises unrearranged human immunoglobulin light chain variable region gene segments) can be used in any of the aspects, embodiments, methods, etc. described herein.

In various embodiments, the unrearranged human immunoglobulin light chain variable region gene segments are operably linked to a human or mouse heavy chain constant region gene sequence (e.g., a heavy chain constant region gene sequence that encodes an immunoglobulin isotype selected from IgM, IgD, IgA, IgE, IgG, and combinations thereof, wherein each heavy chain constant region gene encodes a functional C_(H)1 domain). For example, genetically modified non-human animals are provided comprising (a) an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus) that contains a first nucleotide sequence which contains unrearranged human light chain variable region gene segments (i.e., where the first nucleotide sequence comprises at least 40 human VIC gene segments and 5 human Jκ gene segments), wherein the first nucleotide sequence is operably linked to a human or non-human heavy chain constant region gene sequence; and (b) an immunoglobulin light chain locus that contains a second nucleotide sequence that encodes a light chain variable domain (i.e., where the second nucleotide sequence is a rearranged human immunoglobulin light chain variable region nucleotide sequence or where the second nucleotide sequence contains a limited number of human V_(L) gene segments; e.g., two or more but less than the wild type number of human V_(L) gene segments), wherein the second nucleotide sequence is operably linked to a human or non-human light chain constant region gene sequence comprising one or more heavy chain constant region genes each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising at least an intact Igμ gene and at least one of an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and an intact Igα gene. In some embodiments, the human heavy chain constant region gene further encodes a hinge, a C_(H)2, a C_(H)3, and combinations thereof. In some embodiments, a mouse heavy chain constant region gene further encoedes a hinge, a C_(H)2, a C_(H)3, and combinations thereof. In some embodiments, further replacement of certain non-human animal constant region gene sequences with human gene sequences (e.g., replacement of mouse C_(H)1 sequence with human C_(H)1 sequence, and replacement of mouse C_(T), sequence with human C_(r), sequence) results in genetically modified non-human animals with chimeric (and hybrid) immunoglobulin loci that make antibodies that have human variable regions and partly human constant regions, suitable for, e.g., making fully human antibody fragments, e.g., fully human Fab's. In some embodiments, the unrearranged human light chain variable region gene segments are operably linked to a rat heavy chain constant region gene sequence comprising one or more heavy chain constant region genes each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising at least an intact Igμ gene and at least one of an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and an intact Igα gene. In some embodiments, the rat heavy chain constant region gene further encodes a C_(H)2, a C_(H)3, and combinations thereof. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence (or a limited number of human V_(L) gene segments) is operably linked with a human Cκ region sequence. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence (or a limited number of human V_(L) gene segments) is operably linked with a mouse or rat Cκ region sequence. In various embodiments, the genetically modified immunoglobulin light chain locus of the non-human animal comprises two copies, three copies, four copies or more of the rearranged human immunoglobulin light chain variable region nucleotide sequence operably linked to a light chain constant region gene sequence. In particular embodiments, the immunoglobulin light chain locus comprises a plurality of copies of the rearranged human immunoglobulin light chain variable region nucleotide sequence operably linked to a light chain constant region gene sequence.

In various embodiments, a (human) IgG1, IgG2, or IgG4 heavy chain constant region gene (e.g., cloned in an expression vector, at an endogenous locus) etc., comprises one or more modification(s) in a C_(H)3 encoding sequence of the gene, wherein the modification reduces or eliminates affinity of the C_(H)3 domain encoded by the modified encoding sequence to Protein A (see, e.g., U.S. Pat. No. 8,586,713, incorporated herein in its entirety by reference). Such modification includes, but is not limited to a mutation selected from the group consisting of (a) 95R, and (b) 95R and 96F in the IMGT numbering system, or (a′) 435R, and (b′) 435R and 436F in the EU numbering system. In some embodiments, the (human and) mutated heavy chain constant region is a (human and) mutated IgG1 constant region and, in addition to the (a) 95R or (b) 95R and 96F mutation (in the IMGT numbering system), further comprises one to five modifications selected from the group consisting of 16E, 18M, 44S, 52N, 57M, and 82I in the IMGT exon numbering system, or 356E, 358M, 384S, 392N, 397M, and 422I in the EU numbering system. In some embodiments, the heavy chain constant gene is a (human) IgG2 constant gene and, in addition to the (a) 95R or (b) 95R and 96F mutation (in the IMGT numbering system), further comprises one or two modifications selected from the group consisting of 44S, 52N, 82I in the IMGT exon numbering system, or 348S, 392N and 422I in the EU numbering system. In other embodiments, the (human) heavy chain constant gene is a (human) IgG4 constant gene and, in addition to the (a) 95R or (b) 95R and 96F mutation (in the IMGT numbering system), further comprises one to seven modifications selected from the group consisting of 15R, 44S, 52N, 57M, 69K, 79Q and 82I in the IMGT exon numbering system or 355R, 384S, 392N, 397M, 409K, 419Q and 422I in the EU numbering system and/or the modification 105P in the IGMT exon numbering system or 445P in the EU numbering system.

In various embodiments, the heavy chain constant region nucleotide sequence comprises a modification in a C_(H)2 or a C_(H)3, wherein the modification increases the affinity of the heavy chain constant region amino acid sequence to FcRn in an acidic environment (e.g., in an endosome where pH ranges from about 5.5 to about 6.0). In some embodiments, the heavy chain constant region nucleotide sequence encodes a human heavy chain constant region amino acid sequence comprising a modification at position 250 by EU numbering (263 by Kabat numbering) (e.g., E or Q); 250 by EU numbering (263 by Kabat numbering) and 428 by EU numbering (459 by Kabat numbering) (e.g., L or F); 252 by EU numbering (265 by Kabat numbering) (e.g., L/Y/F/W or T), 254 by EU numbering (267 by Kabat numbering) (e.g., S or T), and 256 by EU numbering (269 by Kabat numbering) (e.g., S/R/Q/E/D or T); or a modification at position 428 by EU numbering (459 by Kabat numbering) and/or 433 by EU numbering (464 by Kabat numbering) (e.g., L/R/S/P/Q or K) and/or 434 by EU numbering (465 by Kabat numbering) (e.g., H/F or Y); or a modification at position 250 by EU numbering (263 by Kabat numbering) and/or 428 by EU numbering (459 by Kabat numbering); or a modification at position 307 by EU numbering (326 by Kabat numbering) or 308 by EU numbering (327 by Kabat numbering) (e.g., 308F, V308F), and 434 by EU numbering (465 by Kabat numbering). In one embodiment, the modification comprises a 428L (e.g., M428L) and 434S (e.g., N434S) modification by EU numbering (a 459, e.g., M459L, and 465S (e.g., N465S) modification by Kabat numbering); a 428L, 259I (e.g., V259I), and 308F (e.g., V308F) modification by EU numbering (a 459L, 272I (e.g., V272I), and 327F (e.g., V327F) modification by Kabat numbering; a 433K (e.g., H433K) and a 434 (e.g., 434Y) modification by EU numbering (a 464K (e.g., H464K) and a 465 (e.g., 465Y) modification by Kabat numbering; a 252, 254, and 256 (e.g., 252Y, 254T, and 256E) modification by EU numbering (a 265, 267, 269 (e.g., 265Y, 267T, and 269E) modification by Kabat numbering; a 250Q and 428L modification (e.g., T250Q and M428L) by EU numbering (a 263Q and 459L modification, e.g., T263Q and M459L, by Kabat numbering); and a 307 and/or 308 modification (e.g., 307F or 308P) by EU numbering (326 and/or 327 modification, e.g., 326F or 308P, by Kabat numbering), wherein the modification increases the affinity of the heavy chain constant region amino acid sequence to FcRn in an acidic environment (e.g., in an endosome where pH ranges from about 5.5 to about 6.0). In some embodiments, the heavy chain constant region nucleotide sequence encodes a human C₁₁2 amino acid sequence comprising at least one modification between amino acid residues at positions 252 and 257 by EU numbering (i.e., at least one modification between amino acid positions 265 and 270 by Kabat numbering), wherein the modification increases the affinity of the human C_(H)2 amino acid sequence to FcRn in an acidic environment (e.g., in an endosome where pH ranges from about 5.5 to about 6.0). In some embodiments, the heavy chain constant region nucleotide sequence encodes a human C_(H)2 amino acid sequence comprising at least one modification between amino acid residues at positions 307 and 311 (i.e., at least one modification between amino acid positions 326 and 330 by Kabat numbering), wherein the modification increases the affinity of the C_(H)2 amino acid sequence to FcRn in an acidic environment (e.g., in an endosome where pH ranges from about 5.5 to about 6.0). In some embodiments, the heavy chain constant region nucleotide sequence encodes a human C_(H)3 amino acid sequence, wherein the C_(H)3 amino acid sequence comprises at least one modification between amino acid residues at positions 433 and 436 by EU numbering (i.e., at least one modification between amino acid residues at positions 464 and 467 by Kabat numbering), wherein the modification increases the affinity of the C_(H)3 amino acid sequence to FcRn in an acidic environment (e.g., in an endosome where pH ranges from about 5.5 to about 6.0). In some embodiments, the heavy chain constant region nucleotide sequence encodes a human heavy chain constant region amino acid sequence comprising a mutation selected from the group consisting of M428L by EU numbering (459 by Kabat numbering), N434S by EU numbering (465 by Kabat numbering), and a combination thereof. In some embodiments, the heavy chain constant region nucleotide sequence encodes a human heavy chain constant region amino acid sequence comprising a mutation selected from the group consisting of M428L by EU numbering (M459L by Kabat numbering), V259I by EU numbering (V272I by Kabat numbering), V308F by EU numbering (V327 by Kabat numbering), and a combination thereof. In some embodiments, the heavy chain constant region nucleotide sequence encodes a human heavy chain constant region amino acid sequence comprising an N434A mutation by EU numbering (an N465A mutation by Kabat numbering). In some embodiments, the heavy chain constant region nucleotide sequence encodes a human heavy chain constant region amino acid sequence comprising a mutation selected from the group consisting of M252Y by EU numbering (M265Y by Kabat numbering), S254T by EU numbering (S267T by Kabat numbering), T256E by EU numbering (T269E by Kabat numbering), and a combination thereof. In some embodiments, the heavy chain constant region nucleotide sequence encodes a human heavy chain constant region amino acid sequence comprising a mutation selected from the group consisting of T250Q by EU numbering (T263Q by Kabat numbering), M428L by EU numbering (M459L by Kabat numbering), and a combination thereof. In some embodiments, the heavy chain constant region nucleotide sequence encodes a human heavy chain constant region amino acid sequence comprising a mutation selected from the group consisting of H433K by EU numbering (H464K by Kabat numbering), N434Y by EU numbering (N465Y by Kabat numbering), and a combination thereof.

In various embodiments, a non-human animal as described herein is immunized with an antigen of interest, and a B cell expressing an antigen-binding protein that specifically binds the antigen of interest is identified, and a nucleic acid sequence of the B cell which encodes a light chain variable domain in a polypeptide comprising a heavy chain constant region is identified and determined. The nucleic acid sequence of the light chain variable domain is expressed, in a suitable cell and employing a suitable expression vector, with a heavy chain constant nucleic acid sequence comprising one, two, three, or more modifications. In some embodiments, the light chain variable region is human, and the heavy chain sequence is human. In some embodiments, the heavy chain constant region nucleotide sequence comprises a modification in a C_(H)2 or a C_(H)3, wherein the modification increases the affinity of the heavy chain constant region amino acid sequence to FcRn in an acidic environment (e.g., in an endosome where pH ranges from about 5.5 to about 6.0). In some embodiments, the heavy chain constant region nucleotide sequence encodes a human heavy chain constant region amino acid sequence comprising a modification at position 250 by EU numbering (263 by Kabat numbering) (e.g., E or Q); 250 by EU numbering (263 by Kabat numbering) and 428 by EU numbering (459 by Kabat numbering) (e.g., L or F); 252 by EU numbering (265 by Kabat numbering) (e.g., L/Y/F/W or T), 254 by EU numbering (267 by Kabat numbering) (e.g., S or T), and 256 by EU numbering (269 by Kabat numbering) (e.g., S/R/Q/E/D or T); or a modification at position 428 by EU numbering (459 by Kabat numbering) and/or 433 by EU numbering (464 by Kabat numbering) (e.g., L/R/S/P/Q or K) and/or 434 by EU numbering (465 by Kabat numbering) (e.g., H/F or Y); or a modification at position 250 by EU numbering (263 by Kabat numbering) and/or 428 by EU numbering (459 by Kahat numbering); or a modification at position 307 by EU numbering (326 by Kahat numbering) or 308 by EU numbering (327 by Kabat numbering) (e.g., 308F, V308F), and 434 by EU numbering (465 by Kabat numbering). In one embodiment, the modification comprises a 428L (e.g., M428L) and 434S (e.g., N434S) modification by EU numbering (a 459, e.g., M459L, and 465S (e.g., N465S) modification by Kabat numbering); a 428L, 259I (e.g., V259I), and 308F (e.g., V308F) modification by EU numbering (a 459L, 272I (e.g., V272I), and 327F (e.g., V327F) modification by Kabat numbering; a 433K (e.g., H433K) and a 434 (e.g., 434Y) modification by EU numbering (a 464K (e.g., H464K) and a 465 (e.g., 465Y) modification by Kabat numbering; a 252, 254, and 256 (e.g., 252Y, 254T, and 256E) modification by EU numbering (a 265, 267, 269 (e.g., 265Y, 267T, and 269E) modification by Kabat numbering; a 250Q and 428L modification (e.g., T250Q and M428L) by EU numbering (a 263Q and 459L modification, e.g., T263Q and M459L, by Kabat numbering); and a 307 and/or 308 modification (e.g., 307F or 308P) by EU numbering (326 and/or 327 modification, e.g., 326F or 308P, by Kabat numbering), wherein the modification increases the affinity of the heavy chain constant region amino acid sequence to FcRn in an acidic environment (e.g., in an endosome where pH ranges from about 5.5 to about 6.0). In some embodiments, the heavy chain constant region nucleotide sequence encodes a human C₁₁2 amino acid sequence comprising at least one modification between amino acid residues at positions 252 and 257 by EU numbering (i.e., at least one modification between amino acid positions 265 and 270 by Kabat numbering), wherein the modification increases the affinity of the human C_(H)2 amino acid sequence to FcRn in an acidic environment (e.g., in an endosome where pH ranges from about 5.5 to about 6.0). In some embodiments, the heavy chain constant region nucleotide sequence encodes a human C_(H)2 amino acid sequence comprising at least one modification between amino acid residues at positions 307 and 311 (i.e., at least one modification between amino acid positions 326 and 330 by Kabat numbering), wherein the modification increases the affinity of the C_(H)2 amino acid sequence to FcRn in an acidic environment (e.g., in an endosome where pH ranges from about 5.5 to about 6.0). In some embodiments, the heavy chain constant region nucleotide sequence encodes a human C_(H)3 amino acid sequence, wherein the C_(H)3 amino acid sequence comprises at least one modification between amino acid residues at positions 433 and 436 by EU numbering (i.e., at least one modification between amino acid residues at positions 464 and 467 by Kabat numbering), wherein the modification increases the affinity of the C_(H)3 amino acid sequence to FcRn in an acidic environment (e.g., in an endosome where pH ranges from about 5.5 to about 6.0). In some embodiments, the heavy chain constant region nucleotide sequence encodes a human heavy chain constant region amino acid sequence comprising a mutation selected from the group consisting of M428L by EU numbering (459 by Kabat numbering), N434S by EU numbering (465 by Kabat numbering), and a combination thereof. In some embodiments, the heavy chain constant region nucleotide sequence encodes a human heavy chain constant region amino acid sequence comprising a mutation selected from the group consisting of M428L by EU numbering (M459L by Kabat numbering), V259I by EU numbering (V272I by Kabat numbering), V308F by EU numbering (V327 by Kabat numbering), and a combination thereof. In some embodiments, the heavy chain constant region nucleotide sequence encodes a human heavy chain constant region amino acid sequence comprising an N434A mutation by EU numbering (an N465A mutation by Kabat numbering). In some embodiments, the heavy chain constant region nucleotide sequence encodes a human heavy chain constant region amino acid sequence comprising a mutation selected from the group consisting of M252Y by EU numbering (M265Y by Kabat numbering), S254T by EU numbering (S267T by Kabat numbering), T256E by EU numbering (T269E by Kabat numbering), and a combination thereof. In some embodiments, the heavy chain constant region nucleotide sequence encodes a human heavy chain constant region amino acid sequence comprising a mutation selected from the group consisting of T250Q by EU numbering (T263Q by Kabat numbering), M428L by EU numbering (M459L by Kabat numbering), and a combination thereof. In some embodiments, the heavy chain constant region nucleotide sequence encodes a human heavy chain constant region amino acid sequence comprising a mutation selected from the group consisting of H433K by EU numbering (H464K by Kabat numbering), N434Y by EU numbering (N465Y by Kabat numbering), and a combination thereof.

In various embodiments, Fc domains are modified (in the non-human animal; or, in an expression system that expresses together in a single polypeptide a light chain variable domain derived from a heavy chain of a non-human animal as described herein and a heavy chain constant sequence (e.g., a human sequence)) to have altered Fc receptor binding, which in turn affects effector function. In some embodiments, an engineered heavy chain constant region (C_(H)), which includes the Fc domain, is chimeric. As such, a chimeric C_(H) region combines C_(H) domains derived from more than one immunoglobulin isotype. For example, a chimeric C_(H) region comprises part or all of a C_(H)2 domain derived from a human IgG1, human IgG2 or human IgG4 molecule, combined with part or all of a C₁₁3 domain derived from a human IgG1, human IgG2 or human IgG4 molecule. In some embodiments, a chimeric C_(H) region contains a chimeric hinge region. For example, a chimeric hinge may comprise an “upper hinge” amino acid sequence (amino acid residues from positions 216 to 227 according to EU numbering; amino acid residues from positions 226 to 240 according to Kabat numbering) derived from a human IgG1, a human IgG2 or a human IgG4 hinge region, combined with a “lower hinge” sequence (amino acid residues from positions 228 to 236 according to EU numbering; amino acid positions from positions 241 to 249 according to Kabat numbering) derived from a human IgG1, a human IgG2 or a human IgG4 hinge region. In some embodiments, the chimeric hinge region comprises amino acid residues derived from a human IgG1 or a human IgG4 upper hinge and amino acid residues derived from a human IgG2 lower hinge.

In some embodiments, the Fc domain may be engineered to activate all, some, or none of the normal Fc effector functions, without affecting the Fc-containing protein's (e.g. antibody's) desired pharmacokinetic properties. For examples of proteins comprising chimeric C_(H) regions and having altered effector functions, see International Patent Application No. PCT/US2014/14175, filed Jan. 31, 2014, which is herein incorporated in its entirety.

In various aspects, the genome of the non-human animals is modified (i) to delete or render nonfunctional (e.g., via insertion of a nucleotide sequence (e.g., an exogenous nucleotide sequence)) in the immunoglobulin locus or via non-functional rearrangement or inversion of all, or substantially all, endogenous functional immunoglobulin V_(H), D, J_(H) gene segments; and (ii) to comprise unrearranged human immunoglobulin light chain variable region gene segments, wherein the gene segments are present at an endogenous locus (i.e., where the gene segments are located in a wild type non-human animal). In some embodiments, the unrearranged human immunoglobulin light chain variable region gene segments are integrated in the genome (e.g., at a locus different from the endogenous immunoglobulin heavy chain locus in its genome, or within its endogenous locus, e.g., within an immunoglobulin variable locus, wherein the endogenous locus is placed or moved to a different location in the genome). In some embodiments, e.g., about 80% or more, about 85% or more, about 90% or more, about 95% or more, about 96% or more, about 97% or more, about 98% or more, or about 99% or more of all endogenous functional heavy chain V, D, or J gene segments are deleted or rendered non-functional. In some embodiments, e.g., at least 95%, 96%, 97%, 98%, or 99% of endogenous functional heavy chain V, D, or J gene segments are deleted or rendered non-functional. In some embodiments, the unrearranged human immunoglobulin light chain variable region gene segments are operably linked to a human or non-human heavy chain constant region gene sequence.

In some embodiments, the genetically modified non-human animal comprises a modification that deletes or renders non-functional endogenous functional V_(H), D, and J_(H) heavy chain variable gene segments and endogenous functional light chain variable V_(L) and J_(L) gene segments; and comprises (i) a rearranged human immunoglobulin light chain variable region nucleotide sequence (or a limited number of human V_(L) gene segments, e.g., two or more but less than the wild type number of human V_(L) gene segments) and (ii) a nucleotide sequence encoding unrearranged human immunoglobulin light chain V gene segments (V_(L)) and unrearranged human immunoglobulin light chain J gene segments (J_(L)) at an endogenous immunoglobulin locus (e.g., an endogenous immunoglobulin heavy chain locus comprising one or more heavy chain constant region genes each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising at least an intact Igμ gene and at least one of an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and an intact Igα gene) or integrated elsewhere in the genome (e.g., at a locus different from the endogenous immunoglobulin locus in its genome, or within its endogenous locus, e.g., within an immunoglobulin variable region locus, wherein the endogenous locus is placed or moved to a different location in the genome). In some embodiments, the genetically modified non-human animal comprises a modification that deletes or renders non-functional endogenous V_(H), D, and J_(H) heavy chain variable gene segments and endogenous light chain variable V_(L) and J_(L) gene segments; and comprises (i) a rearranged human immunoglobulin light chain variable region nucleotide sequence (or a limited number of human V_(L) gene segments, e.g., two or more but less than the wild type number of human V_(L) gene segments) and (ii) one or more unrearranged human immunoglobulin light chain variable region gene segments (V_(L) and J_(L)) at an endogenous location (e.g., an endogenous immunoglobulin heavy chain locus) or integrated elsewhere in the genome (e.g., at a locus different from the endogenous immunoglobulin chain locus in its genome, or within its endogenous locus, e.g., within an immunoglobulin variable region locus, wherein the endogenous locus is placed or moved to a different location in the genome). In some embodiments, e.g., about 80% or more, about 85% or more, about 90% or more, about 95% or more, about 96% or more, about 97% or more, about 98% or more, or about 99% or more of all endogenous functional heavy chain V, D, or J gene segments are deleted or rendered non-functional. In some embodiments, e.g., at least 95%, 96%, 97%, 98%, or 99% of endogenous functional heavy chain V, D, or J gene segments are deleted or rendered non-functional. In some embodiments, the unrearranged human immunoglobulin light chain variable region gene segments are operably linked to a human or non-human heavy chain constant region gene sequence comprising one or more heavy chain constant region genes each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising at least an intact Igμ gene and at least one of an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and an intact Igα gene. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence (or a limited number of human V_(L) gene segments, e.g., two or more but less than the wild type number of human V_(L) gene segments) is operably linked to a human or non-human light chain constant region gene sequence, either kappa or lambda.

Various embodiments encompass light chain variable domains derived from immunoglobulin hybrid chains encoded by a hybrid immunoglobulin locus. Nucleic acid sequences encoding light chain variable domains may be used in making the genetically modified non-humans described herein, may be expressed by such animals, and/or may encode amino acids present in antibodies produced by (or derived from sequences diversified by) such animals. In some embodiments, the light chain variable domain is a human Vκ domain. In some embodiments, the light chain variable domain is a mouse Vκ domain. In some embodiments, the light chain variable domain is a rat Vκ domain. In some embodiments, the light chain variable domain is a human Vλ domain. In some embodiments, the light chain variable domain is a mouse Vλ domain. In some embodiments, the light chain variable domain is a rat Vλ domain.

In various embodiments, the light chain variable domains produced by the genetically modified non-human animals described herein are encoded by one or more mouse or human immunoglobulin κ light chain variable gene segments. In some embodiments, the one or more mouse immunoglobulin κ light chain variable gene segments comprise about three megabases of the mouse immunoglobulin κ light chain locus. In some embodiments, the one or more mouse immunoglobulin κ light chain variable gene segments comprises at least 137 Vκ gene segments, at least five Jκ gene segments or a combination thereof of the mouse immunoglobulin κ light chain locus. In some embodiments, the one or more human immunoglobulin κ light chain variable gene segments comprises about one-half megabase of a human immunoglobulin κ light chain locus. In specific embodiments, the one or more human immunoglobulin κ light chain variable gene segments comprise the proximal repeat (with respect to the immunoglobulin κ constant region) of a human immunoglobulin κ light chain locus. In some embodiments, the one or more human immunoglobulin κ light chain variable gene segments comprises at least 40 Vκ gene segments, at least five Jκ gene segments or a combination thereof of a human immunoglobulin κ light chain locus.

In particular embodiments, the genetically modified non-human animals further comprise a nucleotide sequence encoding an unrearranged human immunoglobulin light chain (V_(L)) gene segment and an unrearranged human immunoglobulin light chain (J_(L)) gene segment. In some embodiments, the nucleotide sequence encoding the unrearranged light chain V gene segment and the unrearranged light chain J gene segment is operably linked to an immunoglobulin heavy chain constant region gene sequence. In some embodiments, the unrearranged human immunoglobulin light chain V (V_(L)) gene segment and the unrearranged human immunoglobulin J (J_(L)) gene segment are operably linked, at an endogenous rodent locus, to a rodent immunoglobulin heavy chain constant region gene; e.g., an IgM or IgG heavy chain constant region gene, each of which encode a functional C_(H)1 domain.

In various embodiments, the unrearranged human variable region gene segments (e.g., human Vκ gene segments) are capable of rearranging and encoding human variable domains of an antibody. In some embodiments, the non-human animal does not comprise an endogenous V_(L) gene segment. In some embodiments, the human Vκ gene segments expressed by the non-human animals are selected from the group consisting of Vκ1-5, Vκ1-6, Vκ1-8, Vκ1-9, Vκ1-12, Vκ1-13, Vκ1-16, Vκ1-17, Vκ1-22, Vκ1-27, Vκ1-32, Vκ1-33, Vκ1-35, Vκ1-37, Vκ1-39, Vκ1D-8, Vκ1D-12, Vκ1D-13, Vκ1D-16, Vκ1D-17, Vκ1D-22, Vκ1D-27, Vκ1D-32, Vκ1D-33, Vκ1D-35, Vκ1D-37, Vκ1D-39, Vκ1D-42, Vκ1D-43, Vκ1-NL1, Vκ2-4, Vκ2-10, Vκ2-14, Vκ2-18, Vκ2-19, Vκ2-23, Vκ2-24, Vκ2-26, Vκ2-28, Vκ2-29, Vκ2-30, Vκ2-36, Vκ2-38, Vκ2-40, Vκ2D-10, Vκ2D-14, Vκ2D-18, Vκ2D-19, Vκ2D-23, Vκ2D-24, Vκ2D-26, Vκ2D-28, Vκ2D-29, Vκ2D-30, Vκ2D-36, Vκ2D-38, Vκ2D-40, Vκ3-7, Vκ3-11, Vκ3-20, Vκ3-25, Vκ3-31, Vκ3-34, Vκ3D-7, Vκ3D-7, VOD-11, Vκ3D-15, Vκ3D-15, Vκ3D-20, Vκ3D-25, Vκ3D-31, Vκ3D-34, Vκ3-NL1, Vκ3-NL2, Vκ3-NL3, Vκ3-NL4, Vκ3-NL5, Vκ4-1, Vκ5-2, Vκ6-21, Vκ6D-21, Vκ6D-41, and Vκ7-3. In some embodiments, the genetically modified non-human animals described herein express all functional human Vκ genes. In some embodiments, the human Vκ gene segments comprise Vκ4-1, Vκ5-2, Vκ7-3, Vλ2-4, Vκ1-5, and Vκ1-6. In some embodiments, the Vκ gene segments comprise Vκ3-7, Vκ1-8, Vκ1-9, Vκ2-10, Vκ3-11, Vκ1-12, Vκ1-13, Vκ2-14, Vκ3-15 and Vκ1-16. In some embodiments, the human Vκ gene segments comprise Vκ1-17, Vκ2-18, Vκ2-19, Vκ3-20, Vκ6-21, Vκ1-22, Vκ1-23, Vκ2-24, Vκ3-25, Vκ2-26, Vκ1-27, Vκ2-28, Vκ2-29, and Vκ2-30. In some embodiments, the human Vκ gene segments comprise Vκ3-31, Vκ1-32, Vκ1-33, Vκ3-34, Vκ1-35, Vκ2-36, Vκ1-37, Vκ2-38, Vκ1-39, and Vκ2-40. In various embodiments, the non-human animal comprises five human Jκ gene segments, e.g., Jκ1, Jκ2, Jκ3, Jκ4, and Jκ5 gene segments. In specific embodiments, the Vκ gene segments comprise contiguous human immunoglobulin κ gene segments spanning the human immunoglobulin κ light chain locus from Vκ4-1 through Vκ2-40, and the Jκ gene segments comprise contiguous gene segments spanning the human immunoglobulin κ light chain locus from Jκ1 through Jκ5. In some embodiments, the immunoglobulin light chain locus of the non-human animal comprise two human V_(L) gene segments, Vκ1-39 and Vκ3-20. In some embodiments, one or more (e.g., 2, 3, 4, 5, or more) human V_(L) gene segments and two or more human J_(L) gene segments are present at an endogenous heavy chain locus. In some embodiments, the genetically modified non-human animal is a mouse that comprises a functional λ light chain locus. In other embodiments, the mouse comprises a non-functional λ light chain locus.

In some embodiments, a genetically modified non-human animal (e.g., mouse or rat) as described herein expresses a rearranged human immunoglobulin light chain variable region nucleotide sequence (i.e., produces an antigen-binding protein comprising a rearranged light chain variable domain) and one or more, two or more, three or more, four or more, five or more, etc. light chain variable domains encoded by Vκ genes selected from the group consisting of Vκ1-5, Vκ1-6, Vκ1-8, Vκ1-9, Vκ1-12, Vκ1-13, Vκ1-16, Vκ1-17, Vκ1-22, Vκ1-27, Vκ1-32, Vκ1-33, Vκ1-35, Vκ1-37, Vκ1-39, Vκ1D-8, Vκ1D-12, Vκ1D-13, Vκ1D-16, Vκ1D-17, Vκ1D-22, Vκ1D-27, Vκ1D-32, Vκ1D-33, Vκ1D-35, Vκ1D-37, Vκ1D-39, Vκ1D-42, Vκ1D-43, Vκ1-NL1, Vκ2-4, Vκ2-10, Vκ2-14, Vκ2-18, Vκ2-19, Vκ2-23, Vκ2-24, Vκ2-26, Vκ2-28, Vκ2-29, Vκ2-30, Vκ2-36, Vκ2-38, Vκ2-40, Vκ2D-10, Vκ2D-14, Vκ2D-18, Vκ2D-19, Vκ2D-23, Vκ2D-24, Vκ2D-26, Vκ2D-28, Vκ2D-29, Vκ2D-30, Vκ2D-36, Vκ2D-38, Vκ2D-40, Vκ3-7, Vκ3-11, Vκ3-15, Vκ3-20, Vκ3-25, Vκ3-31, Vκ3-34, Vκ3D-7, Vκ3D-7, Vκ3D-11, Vκ3D-15, Vκ3D-15, Vκ3D-20, Vκ3D-25, Vκ3D-31, VOD-34, Vκ3-NL1, Vκ3-NL2, Vκ3-NL3, Vκ3-NL4, Vκ3-NL5, Vκ4-1, Vκ5-2, Vκ6-21, Vκ6D-21, Vκ6D-41, and Vκ7-3.

In various embodiments, the rearranged human light chain variable region nucleotide sequence encodes one or more histidine codons that are not encoded by a corresponding human germ line light chain variable gene segment. In some embodiments, the light chain variable domain as described herein exhibits a decrease in dissociative half-life (t_(1/2)) at an acidic pH as compared to neutral pH of at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 15-fold, at least about 20-fold, at least about 25-fold, or at least about 30-fold. In some embodiments, the decrease in t_(1/2) at an acidic pH as compared to a neutral pH is about 30 fold or more. In some embodiments, the rearranged human light chain variable region nucleotide sequence (or at least one of the limited number of human V_(L) gene segments) comprises a substitution of at least one non-histidine codon encoded by the corresponding human germ line V_(L) gene segment with a histidine codon. In some embodiments, the substitution is of one, two, three, or four codons (e.g., three or four codons). In some embodiments, the substitution is in the CDR3 codon(s). In some embodiments, the human V_(L) gene segments is a human Vκ1-39 or human Vκ3-20 gene segment, and the human Vκ1-39 or human Vκ3-20 gene segment comprises a substitution of at least one non-histidine codon encoded by a corresponding human germ line V_(L) gene segment with the histidine codon. In some embodiments, the human Vκ1-39 or human Vκ3-20 gene segment comprises a substitution of three or four histidine codons. In some embodiments, the three or four substitutions are in the CDR3 region. In some embodiments, the substitution is of three non-histidine codons of the human Vκ1-39 gene segment, wherein the substitution is designed to express histidines at positions 106, 108, and 111. In some embodiments, the substitution is of four non-histidine codons of the human Vκ1-39 gene segment, and the substitution is designed to express histidines at positions 105, 106, 108, and 111 (see, e.g., U.S. Patent Application Publication No. 2013-0247234 A1 and WO 2013/138680, incorporated by reference herein). In some embodiments, the substitution is of three non-histidine codons of the human Vκ3-20 gene segment, and the substitution is designed to express histidines at positions 105, 106, and 109. In yet additional embodiments, the substitution is of four non-histidine codons of the human Vκ3-20 gene segment, and the substitution is designed to express histidines at positions 105, 106, 107, and 109. In some embodiments, the immunoglobulin light chain locus comprises a rearranged human light chain variable region nucleotide sequence (or a limited number of human V_(L) gene segments, e.g., two or more but less than the wild type number of human V_(L) gene segments), wherein the nucleotide sequence (or at least one of the limited number of human V_(L) gene segments) comprises at least one histidine codon that is not encoded by the corresponding human germ line V_(L) gene segment. In various embodiments, the non-human animal comprising the genetically modified immunoglobulin loci as described herein, upon stimulation by an antigen of interest, expresses an antigen-binding protein comprising an amino acid sequence derived from human V_(L) gene segments, wherein the antigen-binding protein retains at least one histidine residue at an amino acid position encoded by the at least one histidine codon introduced into the rearranged human light chain variable region nucleotide sequence (or the at least one of the limited number of human V_(L) gene segments). In some embodiments, the animal expresses a population of antigen-binding proteins in response to an antigen, wherein all antigen-binding proteins in the population comprise (a) immunoglobulin light chain variable domains derived from a rearrangement of the human V_(L) gene segments and the J_(L) gene segments, and (b) immunoglobulin light chains comprising human light chain variable domains encoded by the rearranged human immunoglobulin light chain variable region nucleotide sequence (or encoded by one of the limited number of human V_(L) gene segments), wherein rearranged human immunoglobulin light chain variable region nucleotide sequence (or at least one of the limited number of human V_(L) gene segments) encodes one or more histidine codons that are not encoded by the corresponding human germ line V_(L) gene segment.

Various embodiments encompass light chain constant region sequences. In some embodiments, for example, a first nucleotide sequence that encodes a human light chain variable domain (i.e., where the first nucleotide sequence contains unrearranged human immunoglobulin light chain variable region gene segments) is operably linked to a heavy chain constant region gene sequence, and a second nucleotide sequence that encodes a human light chain variable domain (i.e., where the second nucleotide sequence is a rearranged human immunoglobulin light chain variable nucleotide sequence or where the second sequence includes a limited number of human V_(L) gene segments, e.g., two or more but less than the wild type number of human V_(L) gene segments) is operably linked to a light chain constant region gene sequence. In various embodiments, the light chain constant region sequence operably linked to the rearranged human immunoglobulin light chain variable region nucleotide sequence (or limited number of human V_(L) gene segments) is a human κ light chain constant region sequence. In some embodiments, the light chain constant region sequence operably linked to the rearranged light chain variable region nucleotide sequence (limited number of human V_(L) gene segments) is a mouse κ light chain constant region sequence. In some embodiments, the light chain constant region sequence operably linked to the rearranged light chain variable region nucleotide sequence (limited number of human V_(L) gene segments) is a rat κ light chain constant region sequence. In some embodiments, the light chain constant region sequence operably linked to the rearranged light chain variable region nucleotide sequence (limited number of human V_(L) gene segments) is a human flight chain constant region sequence. In some embodiments, the light chain constant region sequence operably linked to the rearranged light chain variable region nucleotide sequence (limited number of human V_(L) gene segments) is a mouse λ light chain constant region sequence. In some embodiments, the light chain constant region sequence operably linked to the rearranged light chain variable region nucleotide sequence (limited number of human V_(L) gene segments) is a rat 2 light chain constant region sequence.

In various aspects, non-human animals are provided comprising a genetically modified immunoglobulin locus that encodes a rearranged light chain variable domain (e.g., where an immunoglobulin locus comprises a rearranged human immunoglobulin light chain variable region nucleotide sequence or a restricted (limited) number of human V_(L) gene segments), wherein the rearranged light chain variable domain comprises a light chain variable (V_(L)) sequence that is operably linked to a light chain J segment (J_(L)) sequence. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence (or limited number of human V_(L) gene segments) is operably linked to a non-human light chain constant region gene sequence. In some embodiments, the non-human light chain constant region gene sequence is a mouse or a rat constant region gene sequence. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence (or limited number of human V_(L) gene segments) is operably linked to a human light chain constant region gene sequence.

In another aspect, genetically modified non-human animals and methods for making said animals are provided in which the animals comprise a functional universal light chain (“ULC”) immunoglobulin locus (see, e.g., 2011-0195454 A1, US 2012-0021409A1, US 2012-0192300A1, US 2013-0045492A1, US 2013-0185821A1 and US 2013-0302836A1, incorporated by reference herein in their entireties) or a functional dual light chain (“DLC”) immunoglobulin locus (see, e.g., U.S. Patent Application Publication No. US-2013-0198880-A1, incorporated by reference herein in its entirety). In some embodiments, such animals further comprise unrearranged light chain variable region gene segments operably linked to a human or non-human heavy chain constant region gene sequence (i.e., human V_(L) and J_(L) gene segments operably linked to an IgM, IgG, etc.). A ULC or DLC as used in the embodiments described herein can also be used to generate antibody variable chain sequences whose diversity results primarily from the processes of somatic mutation, thereby elucidating antibody variable chain sequences whose antigen-binding capacity benefits from post-genomic events.

Methods of Making and Using Non-Human Animals Comprising a High Diversity Hybrid Chain Locus Containing Unrearranged Light Chain Variable Region Gene Segments and a Low Diversity Light Chain Locus Containing a Rearranged Light Chain Variable Region Sequence

Methods of making and using the genetically modified non-human animals described herein are provided. Methods are provided for placing a rearranged human light chain variable region nucleic acid sequence (or a limited number of human V_(L) gene segments, e.g., two or more but less than the wild type number of human V_(L) gene segments) in operable linkage with an immunoglobulin light chain constant region nucleic acid sequence in the genome of a non-human animal. In various embodiments, the constant region nucleic acid sequence is human or non-human, and the non-human animal is a rodent. In various embodiments, the methods comprise making a non-human animal that further comprises a hybrid immunoglobulin chain locus, e.g., an immunoglobulin locus comprising one or more human light chain variable region gene segments, e.g., 40 human Vκ gene segments and five human Jκ gene segments, operably linked to a human or non-human heavy chain constant region nucleic acid sequence. In various aspects, the methods comprise placing the aforementioned sequences in the germ line of a non-human animal, e.g., a rodent, employing, e.g., transgenic technology including, e.g., employing modified pluripotent or totipotent donor cells (e.g., ES cells or iPS cells) with host embryos, germ cells (e.g., oocytes), etc. Thus, embodiments include a non-human hybrid immunoglobulin chain locus., e.g., an immunoglobulin chain locus in a genome of a non-human germ cell comprising unrearranged human immunoglobulin light chain variable region gene segments operably linked to a heavy chain constant region gene sequence, wherein the constant region gene sequence comprises a non-human sequence, a human sequence, or a combination thereof. In some embodiments, the rearranged human immunoglobulin light chain variable region nucleotide sequence (or a limited number of human V_(L) gene segments) is operably linked to an endogenous non-human immunoglobulin constant region gene sequence. In some embodiments, the endogenous non-human immunoglobulin constant region gene sequence is a mouse or a rat light chain constant region gene sequence.

In various aspects, a method of making a non-human animal that comprises a genetically modified immunoglobulin locus is provided, wherein the method comprises: (a) modifying a genome of a non-human animal to delete or render non-functional endogenous functional immunoglobulin heavy chain V, D, and J gene segments; and (b) placing in the genome unrearranged human immunoglobulin light chain variable region gene segments. In one such aspect, a method is provided for making a non-human animal that expresses a single immunoglobulin light chain from a rearranged light chain gene sequence in the germ line of the non-human animal (or expressing an immunoglobulin light chain from a limited number of human V_(L) gene segments, e.g., two or more but less than the wild type number of human V_(L) gene segments), the method comprising a step of genetically modifying a non-human animal such that its entire antibody-expressing mature B cell population expresses a light chain derived from (i) a single V_(L) gene segment, and (ii) a single J_(L) gene segment or from (iii) a limited number of human V_(L) gene segments (e.g., two or more but less than the wild type number of human V_(L) gene segments). In some aspects, the method comprises inactivating or replacing an endogenous light chain immunoglobulin variable locus with a single rearranged light chain gene (or limited number of human V_(L) gene segments) as described herein.

In another aspect, methods of making a non-human animal that comprises a genetically modified immunoglobulin heavy chain locus are provided, such methods comprising: (a) modifying a genome of a non-human animal to delete or render non-functional endogenous functional immunoglobulin heavy chain V, D, and J gene segments; and (b) placing in the genome unrearranged human immunoglobulin light chain variable region gene segments. In some embodiments, substantially all endogenous functional V_(H), D, and J_(H) gene segments are deleted from the immunoglobulin heavy chain locus of the non-human animal or rendered non-functional (e.g., via insertion of a nucleotide sequence (e.g., an exogenous nucleotide sequence in the immunoglobulin locus) or via non-functional rearrangement, or inversion of, endogenous V_(H), D, J_(H) segments). In some embodiments, the method comprises inserting unrearranged human immunoglobulin light chain variable region gene segments into an endogenous location (e.g., an endogenous immunoglobulin heavy chain locus). In some embodiments, the unrearranged human immunoglobulin light chain variable region gene segments are present elsewhere in the genome (e.g., at a locus different from the endogenous immunoglobulin chain locus in its genome, or within its endogenous locus, e.g., within an immunoglobulin variable locus, wherein the endogenous locus is placed or moved to a different location in the genome). In some embodiments, e.g., about 80% or more, about 85% or more, about 90% or more, about 95% or more, about 96% or more, about 97% or more, about 98% or more, or about 99% or more of all endogenous functional V, D, or J gene segments are deleted or rendered non-functional. In some embodiments, e.g., at least 95%, 96%, 97%, 98%, or 99% of endogenous functional heavy chain V, D, or J gene segments are deleted or rendered non-functional.

In another aspect, methods are provided for making a non-human animal that comprises a genetically modified immunoglobulin locus, comprising: (a) modifying a genome of a non-human animal to delete or render non-functional endogenous functional immunoglobulin light chain V and J gene segments; and (b) placing in an endogenous immunoglobulin light chain locus a rearranged human or non-human immunoglobulin light chain variable region nucleotide sequence (i.e., a nucleotide sequence that encodes a rearranged light chain variable domain) or a limited number of human or non-human V_(L) gene segments (e.g., two or more but less than the wild type number of human V_(L) gene segments), wherein the nucleotide sequence (or limited number of human or non-human V_(L) gene segments) is operably linked to a light chain constant region gene sequence. In some embodiments, the genetically engineered immunoglobulin locus is present in the germ line genome of the non-human animal. In some embodiments, the rearranged human or non-human immunoglobulin light chain variable region nucleotide sequence (or limited number of human or non-human V_(L) gene segments) is operably linked to a κ light chain constant region gene sequence. In some embodiments, the rearranged human or non-human immunoglobulin light chain variable region nucleotide sequence (or limited number of human or non-human V_(L) gene segments) is operably linked to a mouse or rat κ light chain constant region gene sequence. In some embodiments, the rearranged human or non-human immunoglobulin light chain variable region nucleotide sequence (or limited number of human or non-human V_(L) gene segments) is operably linked to a human κ light chain constant region gene sequence. In some embodiments, the rearranged human or non-human immunoglobulin light chain variable region nucleotide sequence (or limited number of human or non-human V_(L) gene segments) is operably linked to a λ light chain constant region gene sequence. In some embodiments, rearranged human or non-human immunoglobulin light chain variable region nucleotide sequence (or limited number of human or non-human V_(L) gene segments) is operably linked to a mouse or rat λ light chain constant region gene sequence. In some embodiments, the rearranged human or non-human immunoglobulin light chain variable region nucleotide sequence (or limited number of human or non-human V_(L) gene segments) is operably linked to a human λ light chain constant region gene sequence.

In some embodiments, the limited number of human or non-human V_(L) gene segments are operably linked to one or more human or non-human J_(L) gene segments.

In another aspect, methods are provided for making a non-human animal that comprises a genetically modified immunoglobulin locus, comprising: (a) modifying a genome of a non-human animal to delete or render non-functional: (i) endogenous functional immunoglobulin heavy chain V, D, and J gene segments, and (ii) endogenous functional immunoglobulin light chain V and J gene segments; and (b) placing in the genome: (i) a first nucleotide sequence that encodes a rearranged light chain variable domain (e.g., where the first nucleotide sequence is a rearranged human immunoglobulin light chain variable region nucleotide sequence or where the first nucleotide sequence contains a limited number of human V_(L) gene segments, e.g., two or more but less than the wild type number of human V_(L) gene segments), wherein the first nucleotide sequence is operably linked to a light chain constant region gene sequence, and (ii) a second nucleotide sequence that encodes a human immunoglobulin light chain variable domain (i.e., where the second nucleotide sequence is an unrearranged human immunoglobulin light chain variable region nucleotide sequence), wherein the second nucleotide sequence is operably linked to a heavy chain constant region gene sequence comprising one or more heavy chain constant region genes each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising at least an intact Igμ gene and at least one of an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and an intact Igα gene. In some embodiments, the genetically engineered immunoglobulin locus is present in the germ line genome of the non-human animal. In some embodiments, the first nucleotide sequence that encodes the rearranged light chain variable domain (or contains a limited number of human V_(L) gene segments) is operably linked to a κ light chain constant region gene sequence. In some embodiments, the first nucleotide sequence that encodes the rearranged light chain variable domain (or contains a limited number of human V_(L) gene segments) is operably linked to a mouse or rat κ light chain constant region gene sequence. In some embodiments, the first nucleotide sequence that encodes the rearranged light chain variable domain (or contains a limited number of human V_(L) gene segments) is operably linked to a human κ light chain constant region gene sequence. In some embodiments, the first nucleotide sequence that encodes the rearranged light chain variable domain (or contains a limited number of human V_(L) gene segments) is operably linked to a λ light chain constant region gene sequence. In some embodiments, the first nucleotide sequence that encodes the rearranged light chain variable domain (or contains a limited number of human V_(L) gene segments) is operably linked to a mouse or rat λ light chain constant region gene sequence. In some embodiments, the first nucleotide sequence that encodes the rearranged light chain variable domain (or contains a limited number of human V_(L) gene segments) is operably linked to a human 2 light chain constant region gene sequence. In some embodiments, the human immunoglobulin light chain variable domain is a κ light chain variable domain. Thus, in some embodiments, the second nucleotide sequence is a human kappa light chain variable region nucleotide sequence. In some embodiments, the human immunoglobulin light chain variable domain is a 2 light chain variable domain. Thus, in some embodiments, the second nucleotide sequence is a human lambda light chain variable region nucleotide sequence. In some embodiments, the heavy chain constant region gene sequence is a non-human immunoglobulin heavy chain constant region gene sequence. In some embodiments, the non-human immunoglobulin heavy chain constant region gene sequence is a mouse or a rat heavy chain constant region gene sequence. In some embodiments, the non-human immunoglobulin heavy chain constant region gene sequence comprises an intact Igμ gene, an intact Igδ gene, an intact Igγ gene, an intact Igα gene, and/or an intact Igε gene.

Methods are provided for making a non-human animal, comprising: (a) modifying a genome of a non-human animal to delete or render non-functional (i) endogenous immunoglobulin heavy chain V_(H), D, and and/or J_(H) gene segments, and (ii) endogenous immunoglobulin light chain V and J gene segments; and (b) placing (i) a rearranged light chain variable region nucleotide sequence (or a limited number of human V_(L) gene segments, e.g., two or more but less than the wild type number of human V_(L) gene segments) at a light chain locus, wherein the rearranged light chain variable region nucleotide sequence (or limited number of human V_(L) gene segments) comprises a light chain V gene segment (V_(L)) sequence that is operably linked to a light chain J gene segment (J_(L)) sequence; and (ii) one or more unrearranged human immunoglobulin light chain variable region gene segments (e.g., 40 human Vκ gene segments and at least one human Jκ gene segments) at a heavy chain locus so that the gene segments are operably linked to a human or non-human heavy chain constant region nucleotide sequence comprising one or more heavy chain constant region genes, each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising an intact Igγ, gene, an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and/or an intact Igα gene. In some embodiments, the rearranged light chain variable region nucleotide sequence encodes one or more histidine codons that are not encoded by a corresponding human germ line light chain variable gene segment.

In some aspects, a method for making a non-human animal comprising a genetically modified immunoglobulin locus is provided, comprising: (a) modifying a genome of a non-human animal to delete or render non-functional endogenous immunoglobulin light chain V and J gene segments; and (b) placing in the genome of the non-human animal a rearranged human or non-human light chain variable region nucleotide sequence (or a limited number of human or non-human V_(L) gene segments) in operable linkage to a light chain constant region nucleotide sequence.

In various embodiments, the non-human animal is a rodent, e.g., a mouse, a rat, or a hamster. In some embodiments, the rodent is a mouse. In some embodiments, the light chain constant region is a rat or a mouse constant region, e.g., a rat or a mouse Cκ constant region.

In another aspect, a method for making a non-human animal comprising a genetically modified immunoglobulin locus is provided, comprising: (a) modifying a genome of a non-human animal to delete or render non-functional: (i) endogenous immunoglobulin heavy chain V, D, and/or J gene segments, and (ii) endogenous immunoglobulin light chain V and J gene segments; and (b) placing in the genome of the non-human animal: (i) a first nucleotide sequence that encodes a rearranged light chain variable domain (e.g., where the first nucleotide sequence is a rearranged human or non-human immunoglobulin light chain variable region nucleotide sequence or where the first nucleotide sequence contains a limited number of human V_(L) gene segments; e.g., two or more but less than the wild type number of human V_(L) gene segments), wherein the first nucleotide sequence is operably linked to a light chain constant region gene sequence, and (ii) a second nucleotide sequence that encodes a human or non-human light chain variable domain (i.e., where the second nucleotide sequence is an unrearranged human immunoglobulin light chain variable region nucleotide sequence), wherein the second nucleotide sequence is operably linked to a heavy chain constant region gene sequence. In some embodiments, the heavy chain constant region gene sequence comprises an intact Igμ gene, an intact Igδ gene, an intact Igγ gene, an intact Igα gene, and/or an intact Igε gene.

In various embodiments, the non-human animal is a rodent, e.g., a mouse, a rat, or a hamster. In some embodiments, the rodent is a mouse. In some embodiments, the light chain constant region is a rat or a mouse constant region, e.g., a rat or a mouse Cκ constant region. In some embodiments, the second nucleotide sequence is operably linked to a mouse or rat heavy chain constant region gene sequence comprising a nucleotide sequence encoding a C_(H)1, a hinge, a C_(H)2, a C_(H)3, or a combination thereof. In some embodiments, the second nucleotide sequence is operably linked to a human heavy chain constant region gene sequence comprising a nucleotide sequence encoding a C_(H)1, a hinge, a C_(H)2, a C_(H)3, or a combination thereof.

In another aspect, a method is provided for making a non-human animal that comprises a genetically modified immunoglobulin locus, comprising: (a) modifying a genome of a non-human animal to delete or render non-functional: (i) endogenous immunoglobulin heavy chain V, D, and/or J gene segments, and (ii) endogenous immunoglobulin light chain V and J gene segments; and (b) placing in the genome of the non-human animal: (i) a first allele comprising a first nucleotide sequence that encodes a rearranged light chain variable domain (e.g., where the first nucleotide sequence is a rearranged human immunoglobulin light chain variable region nucleotide sequence or where the first nucleotide sequence contains a limited number of human V_(L) gene segments; e.g., two or more but less than the wild type number of human V_(L) gene segments) operably linked to a light chain constant region gene sequence, and (ii) a second allele comprising a second nucleotide sequence that encodes a light chain variable domain (i.e., where the second nucleotide sequence is an unrearranged human immunoglobulin light chain variable region nucleotide sequence) operably linked to a heavy chain constant region gene sequence.

In another aspect, a method of making a non-human animal that comprises a genetically modified immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus) and a modified immunoglobulin light chain locus is provided comprising: (a) modifying a genome of a non-human animal to delete or render non-functional endogenous immunoglobulin heavy chain V, D, and and/or J gene segments; (11) placing in an endogenous heavy chain locus of the non-human animal unrearranged human immunoglobulin light chain variable region gene segments in operable linkage with a heavy chain constant region, wherein the unrearranged human immunoglobulin light chain variable region gene segments comprise human Vκ and human Jκ gene segments; (c) modifying a genome of a non-human animal to delete or render non-functional endogenous immunoglobulin light chain V and and/or J gene segments; and (d) placing in an endogenous light chain locus of the non-human animal a rearranged human immunoglobulin light chain variable region nucleotide sequence (or a limited number of human V_(L) gene segments, e.g., two or more but less than the wild type number of human V_(L) gene segments) in operable linkage with a light chain constant region, wherein the rearranged human immunoglobulin light chain variable region nucleotide sequence (or limited number of human V_(L) gene segments) comprises a rearranged human VκJκ sequence (or 2, 3, or 4 human V_(L) gene segments). In some embodiments, the rearranged human VκJκ sequence is a human Vκ1-39Jκ5 sequence (e.g., set forth in SEQ ID NO:1). In some embodiment, the rearranged human VκJκ sequence is a human Vκ3-20Jκ1 sequence (e.g., set forth in SEQ ID NO:2). In some embodiments, the limited number of human V_(L) gene segments includes a human Vκ1-39 gene segment and a human Vκ3-20 gene segment. In some embodiments, the heavy chain constant region gene sequence comprises an intact Igμ gene, an intact Igδ gene, an intact Igγ gene, an intact Igα gene, and/or an intact Igε gene.

In various embodiments, the non-human animal is a rodent, e.g., a mouse, a rat, or a hamster. In some embodiments, the rodent is a mouse. In some embodiments, the light chain constant region is a rat or a mouse constant region, e.g., a rat or a mouse Cκ constant region. In some embodiments, the unrearranged human light chain variable region gene segments are operably linked to a mouse or rat heavy chain constant region gene sequence comprising a nucleotide sequence encoding C_(H)1, a hinge, a C_(H)2, a C_(H)3, or a combination thereof. In some embodiments, the unrearranged light chain variable region gene segments are operably linked to a human heavy chain constant region gene sequence comprising a nucleotide sequence encoding a C_(H)1, a hinge, a C_(H)2, a C_(H)3, or a combination thereof. In some embodiments, the unrearranged light chain variable region gene segments are operably linked to a human heavy chain constant region gene sequence comprising a nucleotide sequence encoding each of a C_(H)1, a hinge, a C_(H)2, and a C_(H)3 domain.

In another aspect, a method for making a non-human animal comprising a genetically modified immunoglobulin locus is provided, comprising: (a) modifying a genome of a non-human animal to delete or render non-functional: (i) endogenous immunoglobulin heavy chain V, D, and/or J gene segments, and (ii) endogenous immunoglobulin light chain V and J gene segments; and (b) placing in the genome of the non-human animal: (i) a rearranged human immunoglobulin light chain variable region nucleotide sequence (or a limited number of human V_(L) gene segments, e.g., two or more but less than the wild type number of human V_(L) gene segments) in operable linkage to a light chain constant region nucleotide sequence; and (ii) one or more human immunoglobulin light chain variable V_(L) and J_(L) gene segments in operable linkage to a heavy chain constant region nucleic acid sequence comprising one or more heavy chain constant region genes, each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising an intact Igμ gene, an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and/or an intact Igα gene.

In various embodiments, the non-human animal is a rodent, e.g., a mouse, a rat, or a hamster. In some embodiments, the rodent is a mouse. In some embodiments, the light chain constant region is a rat or a mouse constant region, e.g., a rat or a mouse Cκ constant region.

In another aspect, nucleic acid sequences encoding rearranged light chain variable domains are provided. In some embodiments, the nucleic acid sequence is derived from a human Vκ and Jκ gene segments. In some embodiments, the nucleic acid sequence is derived from a human germ line Vκ segment and a human germ line Jκ segment. In some embodiments, the human Vκ segment corresponds to observed variants in the human population. In various embodiments, the nucleic acid sequence comprises a human Vκ gene selected from the group consisting of Vκ1-5, Vκ1-6, Vκ1-8, Vκ1-9, Vκ1-12, Vκ1-13, Vκ1-16, Vκ1-17, Vκ1-22, Vκ1-27, Vκ1-32, Vκ1-33, Vκ1-35, Vκ1-37, Vκ1-39, Vκ1D-8, Vκ1D- 12, Vκ1D-13, Vκ1D-16, Vκ1D-17, Vκ1D-22, Vκ1D-27, Vκ1D-32, Vκ1D-33, Vκ1D-35, Vκ1D-37, Vκ1D-39, Vκ1D-42, Vκ1D-43, Vκ1-NL1, Vκ2-4, Vκ2-10, Vκ2-14, Vκ2-18, Vκ2-19, Vκ2-23, Vκ2-24, Vκ2-26, Vκ2-28, Vκ2-29, Vκ2-30, Vκ2-36, Vκ2-38, Vκ2-40, Vκ2D-10, Vκ2D-14, Vκ2D-18, Vκ2D-19, Vκ2D-23, Vκ2D-24, Vκ2D-26, Vκ2D-28, Vκ2D-29, Vκ2D-30, Vκ2D-36, Vκ2D-38, Vκ2D-40, Vκ3-7, Vκ3-11, Vκ3-15, Vκ3-20, Vκ3-25, Vκ3-31, Vκ3-34, Vκ3D-7, Vκ3D-7, Vκ3D-11, Vκ3D-15, Vκ3D-15, Vκ3D-20, Vκ3D-25, Vκ3D-31, Vκ3D-34, Vκ3-NL1, Vκ3-NL2, Vκ3-NL3, Vκ3-NL4, Vκ3-NL5, Vκ4-1, Vκ5-2, Vκ6-21, Vκ6D-21, Vκ6D-41, and Vκ7-3, and a polymorphic variant thereof. In some embodiments, the nucleic acid sequence further comprises a human or non-human animal heavy chain constant region gene sequence selected from a nucleotide sequence encoding a C_(H)1, a hinge, a C_(H)2, a C_(H)3, and a combination thereof. In specific embodiments, the nucleic acid comprises a constant region gene sequence comprising a nucleotide sequence encoding a C_(H)1, a hinge, a C_(H)2, and a C_(H)3. In various embodiments, the nucleic acid sequence comprises a human Jκ gene segment is selected from the group consisting of Jκ1, Jκ2, Jκ4, Jκ5, and a polymorphic variant thereof.

In another aspect, a nucleic acid construct is provided comprising an unrearranged human immunoglobulin light chain variable region nucleotide sequence (e.g., a nucleotide sequence that contains unrearranged human V_(L) and J_(L) gene segments) as described herein. In some embodiments, the nucleic acid construct is designed in such a way that the unrearranged human immunoglobulin light chain variable region nucleotide sequence is operably linked to a human or non-human animal heavy chain constant region gene sequence comprising one or more heavy chain constant region genes, each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising an intact Igμgene, an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and/or an intact Igα gene. In some embodiments, the nucleic acid construct contains two, three, four, or more unrearranged human immunoglobulin light chain variable region gene segments operably linked to a heavy chain constant region gene sequence. In some embodiments, the heavy chain constant region gene sequence comprises an intact Igμ gene, an intact Igδ gene, an intact Igγ gene, an intact Igα gene, and/or an intact Igε gene. In some embodiments, the nucleic acid construct is a targeting vector. In some embodiments, the targeting vector comprises an Adam6a gene, an Adam6b gene, or both, in order to prevent fertility problems associated with the deletion of the Adam6a/6b genes (see, for example, U.S. Pat. No. 8,642,835, incorporated by reference in its entirety). In some embodiments, the Adam6a and the Adam6b genes are placed at 5′ upstream of the transcriptional unit of the unrearranged human light chain gene segments. In some embodiments, the targeting vector comprises a selection cassette flanked by recombination sites. In some embodiments, the targeting vector comprises one or more site-specific recombination sites (e.g., a loxP or a FRT site).

In another aspect, methods are provided for obtaining a light chain variable region (V_(L/CH×ULC)) amino acid sequence capable of binding an antigen independently from a heavy chain variable region amino acid sequence, comprising: (a) immunizing a genetically modified non-human animal as described herein (e.g., a genetically modified animal whose genome comprises unrearranged human light chain variable region gene segments in operable linkage with a heavy chain constant region gene and a rearranged human or non-human light chain variable region nucleotide sequence (or a limited number of human or non-human V_(L) gene segments) in operable linkage with a light chain constant region gene) with an antigen of interest, wherein the non-human animal mounts an immune response to the antigen; and (b) obtaining a rearranged light chain (VJ) nucleic acid sequence of a light chain variable domain that specifically binds the antigen from a cell (e.g., a B cell) of the genetically modified non-human animal. In various embodiments, the light chain variable regions produced by such methods are provided.

In some aspects, methods for obtaining a nucleic acid sequence that encodes an immunoglobulin light chain variable region (V_(L/CH×ULC)) domain, comprise: (a) optionally immunizing a non-human animal with an antigen of interest or an immunogen thereof, wherein the non-human animal comprises in its genome (i) unrearranged human light chain variable region gene segments operably linked to a heavy chain constant region gene comprising one or more heavy chain constant region genes, each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising an intact Igμ gene, an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and/or an intact Igα gene, and (ii) a rearranged human or non-human immunoglobulin light chain variable region nucleotide sequence (or a limited number of human or non-human V_(L) gene segments) operably linked to a light chain constant region gene, (b) allowing the non-human animal to mount an immune response, (c) isolating from the immunized non-human animal a cell comprising a nucleic acid sequence that encodes a light chain variable domain that binds the antigen of interest, and (d) obtaining from the cell a nucleic acid sequence that encodes the light chain variable domain (V_(L/CH×ULC) domain) that binds the antigen of interest. In some embodiments, the heavy chain constant region gene sequence is a mouse or rat heavy chain constant region gene sequence. In some embodiments, the heavy chain constant region gene sequence is a human heavy chain constant region gene sequence. In some embodiments, the rearranged light chain variable domain expressed by the genetically modified locus is not autoreactive, i.e., non-immunogenic to the non-human animal. In some embodiments, the non-human animal comprises in its genome one or more (e.g., 6, 16, 30 or 40) unrearranged human V_(L) gene segments and one or more (e.g., 5) human J_(L) gene segments. In some certain embodiments, the unrearranged human V_(L) and J_(L) gene segments are Vκ and Jκ gene segments. In some embodiments, the isolating step (c) is carried out via fluorescence-activated cell sorting (FACS) or flow cytometry. In some embodiments, the cell comprising the nucleic acid sequence that encodes the light chain variable domain that binds the antigen of interest is a lymphocyte. In some embodiments, the lymphocyte comprises natural killer cells, T cells, or B cells. In some embodiments, the method further comprises a step of (c)′ fusing the lymphocyte with a cancer cell. In certain embodiments, the cancer cell is a myeloma cell.

Thus, in various aspects, methods are provided for obtaining a nucleic acid sequence that encodes an immunoglobulin light chain variable domain (V_(L/CH×ULC)) capable of binding an antigen independently from a heavy chain variable domain, comprising: (a) optionally immunizing a non-human animal with an antigen of interest or an immunogen thereof, wherein the non-human animal comprises in its genome (i) a rearranged human or non-human immunoglobulin light chain variable region nucleotide sequence (or a limited number of human or non-human V_(L) gene segments) operably linked to a light chain constant region nucleic acid sequence; and (ii) unrearranged human immunoglobulin light chain variable region gene segments (V_(L) and J_(L)) operably linked to a heavy chain constant region nucleotide sequence; (b) allowing the non-human animal to mount an immune response; (c) isolating from the immunized non-human animal a cell comprising a nucleic acid sequence that encodes a light chain variable domain that can bind the antigen; and (d) obtaining from the cell a nucleic acid sequence that encodes the light chain variable domain (V_(L/CH×ULC) domain) that can bind the antigen.

In some embodiments, the isolating step (c) is carried out via fluorescence-activated cell sorting (FACS) or flow cytometry. In some embodiments, the cell comprising the nucleic acid sequence that encodes the light chain variable domain that binds the antigen is a lymphocyte. In particular embodiments, the lymphocyte comprise natural killer cells, T cells, or B cells. In some embodiments, the methods further comprise a step of (c)′ fusing the lymphocyte with a cancer cell. In particular embodiments, the cancer cell is a myeloma cell. In some embodiments, the nucleic acid sequence of (d) is fused with a nucleic acid sequence encoding an immunoglobulin constant region nucleic acid sequence. In some embodiments, the light chain constant region nucleic acid sequence is a human kappa sequence or a human lambda sequence. In some embodiments, the light chain constant region nucleic acid sequence is a mouse kappa sequence or a mouse lambda sequence. In some embodiments, the light chain constant region nucleic acid sequence is a rat kappa sequence or a rat lambda sequence. In some embodiments, the heavy chain constant region nucleic acid sequence is a human sequence comprising one or more heavy chain constant region genes, each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising an intact Igμ gene, an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and/or an intact Igα gene. In some embodiments, the heavy chain constant region nucleic acid sequence is a mouse or rat sequence comprising one or more heavy chain constant region genes, each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising an intact Igμ, gene, an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and/or an intact Igα gene. In some embodiments, the nucleic acid sequence of (d) comprises one or more histidine codon substitutions or insertions that are derived from the unrearranged V_(L) gene segment in the genome of the animal.

In some aspects, methods are provided for obtaining a nucleic acid sequence that encodes an immunoglobulin light chain variable domain (V_(L/CH×ULC)), comprising: (a) optionally immunizing a non-human animal containing a genetically modified immunoglobulin loci as described herein with an antigen of interest, wherein the non-human animal comprises in its genome a rearranged human immunoglobulin light chain variable region nucleic acid sequence (or a limited number of human V_(L) gene segments) operably linked to a light chain constant region nucleic acid sequence and unrearranged human immunoglobulin light chain variable region gene segments operably linked to a heavy chain constant region nucleic acid sequence comprising one or more heavy chain constant region genes, each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising an intact Igμ gene, an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and/or an intact Igα gene; (b) allowing the non-human animal to mount an immune response; (c) harvesting a lymphocyte (e.g., a B cell) from the immunized non-human animal; (d) fusing the lymphocyte with a myeloma cell to form a hybridoma cell; and (e) obtaining from the hybridoma cell a nucleic acid sequence that encodes a light chain variable domain (V_(L/CH×ULC) domain) that can bind the antigen.

In another aspect, methods are provided for obtaining an immunoglobulin light chain variable region (V_(L/CH×ULC)) amino acid sequence, comprising: (a) optionally immunizing a non-human animal containing genetically modified immunoglobulin loci as described herein with an antigen of interest, wherein the non-human animal comprises in its genome (i) a first nucleotide sequence that encodes a rearranged light chain variable domain (i.e., where the first nucleotide sequence is a rearranged human immunoglobulin light chain variable region nucleotide sequence or where the first nucleotide sequence contains a limited number of human V_(L) gene segments, e.g., two or more but less than the wild type number of human V_(L) gene segments), wherein the first nucleotide sequence is operably linked to a light chain constant region gene sequence; and (ii) a second nucleotide sequence that encodes a human or non-human light chain variable domain (i.e., where the second nucleotide sequence is an unrearranged human immunoglobulin light chain variable nucleotide sequence), wherein the second nucleotide sequence is operably linked to a heavy chain constant region gene sequence comprising one or more heavy chain constant region genes, each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising an intact Igμ gene, an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and/or an intact Igα, gene; (b) allowing the non-human animal to mount an immune response; (c) harvesting a lymphocyte (e.g., a B cell) from the immunized non-human animal; (d) fusing the lymphocyte with a myeloma cell to form a hybridoma cell; and (e) obtaining from the hybridoma cell a nucleic acid sequence that encodes a light chain variable domain (V_(L) domain) that can bind the antigen.

In another aspect, methods are provided for obtaining an immunoglobulin light chain variable region (V_(L/CH×ULC)) nucleic acid sequence of an immunoglobulin hybrid chain, comprising: (a) optionally immunizing a non-human animal containing genetically modified immunoglobulin loci as described herein with an antigen of interest, wherein the non-human animal comprises in its genome (i) a rearranged human immunoglobulin light chain variable region nucleic acid sequence (or a limited number of human V_(L) gene segments, e.g., two or more but less than the number of wild type number of human V_(L) gene segments) operably linked to a light chain constant region nucleic acid sequence; and (ii) one or more (e.g., 6, 16, 30, 40 or more) unrearranged human immunoglobulin light chain variable region gene segments (V_(L) and J_(L)); (b) allowing the non-human animal to mount an immune response; (c) identifying a lymphocyte (e.g., a B cell) from the immunized non-human animal that expresses a V_(L/CH×ULC) amino acid sequence that binds the antigen independently from a heavy chain variable region; and, (d) cloning a nucleic acid sequence encoding the V_(L/CH×ULC) amino acid sequence of (c) from the lymphocyte of (c).

In additional aspects, a genetically modified immunoglobulin locus obtainable by any of the methods as described herein is provided. In various embodiments, the light chain variable regions produced by the methods as described herein and the nucleic acid sequence encoding such light chain variable regions are also provided.

In some aspects, an immunoglobulin heavy chain (e.g., hybrid immunoglobulin chain locus) and light chain locus in a germline genome of a non-human animal are provided, said light chain locus comprising (1) a rearranged human immunoglobulin light chain variable region nucleotide sequence (or a limited number of human V_(L) gene segments) that is operably linked to a light chain constant region gene sequence, and said heavy chain locus (e.g., hybrid immunoglobulin chain locus) comprising (2) an unrearranged human immunoglobulin light chain variable region nucleotide sequence that is operably linked to a heavy chain constant region gene sequence comprising one or more heavy chain constant region genes, each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising an intact Igμ gene, an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and/or an intact Igα gene. In some embodiments, the light chain constant region gene sequence is a κ light chain constant region gene sequence. In some embodiments, the light chain constant region gene sequence is a λ light chain constant region gene sequence. In some embodiments, the light chain constant region gene sequence is a mouse or rat light chain constant region gene sequence. In some embodiments, the rearranged light chain variable region nucleotide sequence is a κ light chain variable region gene sequence. In some embodiments, the rearranged light chain variable region nucleotide sequence is a 2 light chain variable region gene sequence. In some embodiments, the rearranged light chain variable region nucleotide sequence is a mouse or rat light chain variable region gene sequence. In some embodiments, the heavy chain constant region gene sequence comprises an intact Igμ gene, an intact Igδ gene, an intact Igγ gene, an intact Igα gene, and/or an intact Igε gene.

In various embodiments, a limited number of human or non-human V_(L) gene segments includes two human or non human V_(L) gene segments. In some embodiments, the two human or non-human V_(L) gene segments are operably linked to one or more, or five, human or non-human J_(L) gene segments. In some certain embodiments, a limited number of human or non-human V_(L) gene segments include two Vκ gene segments. In some certain embodiments, the two Vκ gene segments are operably linked to one or more, or five, Jκ gene segments.

Antigen-Binding Proteins

Additional aspects include antigen-binding proteins (e.g. antibodies) made by the genetically modified non-human animals described herein. Likewise, antigen-binding proteins (e.g., recombinant antibodies) with light chain variable region (V_(L/CH×ULC)) sequences derived from or produced by (i.e., expressed from the unrearranged human immunoglobulin light chain variable region gene segments) the genetically modified non-human animals described herein are also provided. In some embodiments, the antigen-binding proteins as described herein include an immunoglobulin light chain that can specifically bind an antigen of interest with an affinity (K_(D)) lower than 10⁻⁶, 10⁻⁷, 10⁻⁸, 10⁻⁹ or 10⁻¹⁹. In some embodiments, the immunoglobulin light chain produced by the methods are capable of specifically binding an antigen of interest in the absence of a heavy chain variable region with an affinity (K_(D)) lower than 10⁻⁶, 10⁻⁷, 10⁻⁸, 10⁻⁹, or 10⁻¹⁰.

In various embodiments, the light chain variable domains generated as described herein specifically bind a target molecule (“T”). In one embodiment, a target molecule is any protein, polypeptide, or other macromolecule whose activity or extracellular concentration is desired to be attenuated, reduced or eliminated. In many instances, the target molecule to which a light chain variable region binds is a protein or polypeptide (i.e., a “target protein”); however, also provided are embodiments wherein the target molecule (“T”) is a carbohydrate, glycoprotein, lipid, lipoprotein, lipopolysaccharide, or other non-protein polymer or molecule to which a light chain variable region binds. In various embodiments, T can be a cell surface-expressed target protein or a soluble target protein. Target binding by the antigen-binding molecule may take place in an extracellular or cell surface context. In certain embodiments, however, the antigen-binding molecule binds a target molecule inside the cell, for example within an intracellular component such as the endoplasmic reticulum, Golgi, endosome, lysosome, etc. Examples of cell surface-expressed target molecules include cell surface-expressed receptors, membrane-bound ligands, ion channels, and any other monomeric or multimeric polypeptide component with an extracellular portion that is attached to or associated with a cell membrane.

In another aspect, methods are provided for making an antigen-binding protein that comprises an immunoglobulin light chain variable V_(L/CH×ULC) domain that can bind an antigen independently from a heavy chain variable domain. Such methods comprise (a) optionally immunizing a genetically modified non-human animal with an antigen that comprises an epitope or immunogenic portion thereof, wherein the non-human animal comprises in its genome: (i) a rearranged human light chain variable region nucleic acid sequence (or a limited number of human V_(L) gene segments, e.g., two or more but less than the wild type number of human V_(L) gene segments) operably linked to a light chain constant region nucleic acid sequence; and (ii) unrearranged human immunoglobulin light chain variable region gene segments (V_(L) and J_(L)) operably linked to an immunoglobulin heavy chain constant region nucleic acid sequence; (b) allowing the non-human animal to mount an immune response to the epitope or immunogenic portion thereof; (c) isolating from the non-human animal a cell comprising a nucleic acid sequence that encodes a light chain variable domain that specifically binds the epitope or immunogenic portion thereof and/or (d) obtaining from the cell of (c) the nucleic acid sequence that encodes the light chain variable domain that specifically binds the epitope or immunogenic portion thereof; and (e) employing the nucleic acid sequence of (d) in an expression construct, fused to a human immunoglobulin constant region nucleic acid sequence. e.g., a human heavy chain constant region nucleic acid sequence comprising one or more heavy chain constant region genes, each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising an intact Igγ, gene, an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and/or an intact Igα gene.

In some embodiments, at least one of the unrearranged human light chain V_(L) or J_(L) gene segments encode one or more histidine codons that are not encoded by a corresponding human germline light chain variable gene segment. In some embodiments, rearranged human light chain variable region nucleic acid sequence (or at least one of the limited number of human V_(L) gene segments) encodes one or more histidine codons that are not encoded by a corresponding human germ line light chain variable gene segment. In some embodiments, the epitope is derived from a cell surface receptor.

In some embodiments, at least one of the human light chain V_(L) or J_(L) gene segments encode one or more histidine codons that are not encoded by a corresponding human germline light chain variable gene segment.

As will be clear throughout the specification, in some embodiments, provided protein variable domains are or comprise immunoglobulin-type variable domains (e.g., are or comprise immunoglobulin variable domains). In some embodiments, provided protein variable domains are or comprise heavy chain variable domains; in some embodiments, provided protein variable domains are or comprise light chain variable domains.

Those skilled in the art, reading the present specification, will readily appreciate that any of a variety of technologies can be utilized to produce, generate, and/or assemble antigen-binding proteins comprising light chain that can bind antigen independently from heavy chain variable domain. In some embodiments described herein, the antigen-binding proteins that are produced include antigen-binding proteins depicted in FIG. 19. These antigen-binding proteins comprise variable domains that are generated in non-human animals described herein, and nucleic acid sequences comprising sequences that encode these variable domains are co-expressed in a cell line to produce the antigen-binding proteins.

Genetically Modified Non-Human Cells and Embryos

In various aspects, a pluripotent cell, induced pluripotent, or totipotent stem cells derived from a non-human animal comprising the various genomic modifications herein are provided. In some embodiments, the pluripotent or totipotent cell is derived from a non-human animal. In some embodiments, the non-human animal is a rodent, e.g., a mouse, a rat, or a hamster. In some embodiments, the rodent is a mouse. In specific embodiments, the pluripotent cell is an embryonic stem (ES) cell. In some embodiments, the pluripotent cell comprises in its genome: (i) an immunoglobulin light chain locus that comprises a rearranged human or non-human light chain variable region nucleic acid sequence (or a limited number of human or non-human V_(L) gene segments, e.g., two or more but less than the number of wild type human or non-human V_(L) gene segments) operably linked to a light chain constant region nucleic acid sequence; and (ii) an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus) comprising one or more unrearranged human immunoglobulin V_(L) and J_(L) gene segments, operably linked to a heavy chain constant region nucleic acid sequence comprising one or more heavy chain constant region genes, each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising an intact Igμ gene, an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and/or an intact Igα gene. In some embodiments, the heavy chain constant region gene sequence comprises an intact Igμ gene, an intact Igδ gene, an intact Igγ gene, an intact Igα gene, and/or an intact Igε gene. In specific embodiments, the pluripotent, induced pluripotent, or totipotent stem cells are mouse or rat embryonic stem (ES) cells. In some embodiments, the pluripotent, induced pluripotent, or totipotent stem cells have an XX karyotype or an XY karyotype.

Cells that comprise a nucleus containing a genetic modification as described herein are also provided, e.g., a modification introduced into a cell by pronuclear injection. In another aspect, a hybridoma or quadroma is provided, derived from a cell of the non-human animal as described herein. In some embodiments, the non-human animal is a rodent, such as a mouse, a rat, or a hamster.

In another aspect, a lymphocyte isolated from a genetically modified non-human animal as described herein is provided. In some embodiments, the lymphocyte is a B cell, wherein the B cell comprises an immunoglobulin genomic locus comprising a rearranged human or non-human immunoglobulin light chain variable region nucleotide sequence (or a limited number of human or non-human V_(L) gene segments) operably linked to a human or a non-human animal (e.g., mouse or rat) light chain constant region gene sequence. In some embodiments, the B cell further comprises an immunoglobulin genomic locus comprising a rearranged human immunoglobulin light chain variable region nucleotide sequence operably linked to a human or non-human animal (e.g., mouse or rat) heavy chain constant region gene sequence comprising one or more heavy chain constant region genes, each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising an intact Igμ gene, an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and/or an intact Igα gene. In some embodiments, the B cell is capable of producing antibodies wherein the rearranged light chain variable domain as described herein is operably linked to a heavy chain or light chain constant domain.

In another aspect, a non-human animal embryo comprises a cell whose genome comprises: (i) an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus) comprising unrearranged human light chain variable region gene segments operably linked to a constant region nucleic acid sequence comprising one or more heavy chain constant region genes, each one comprising a sequence encoding a functional C_(H)1 domain, e.g., comprising an intact Igμ gene, an intact Igδ gene, an intact Igγ gene, an intact Igε gene, and/or an intact Igα gene; and (ii) an immunoglobulin light chain locus comprising a rearranged human or non-human immunoglobulin light chain variable region nucleotide sequence (or a limited number of human or non-human V_(L) gene segments) operably linked to a light chain constant region nucleic acid sequence. In some embodiments, the hybrid immunoglobulin locus comprising unrearranged human light chain variable region gene segments operably linked to a constant region nucleic acid sequence is operably linked to a heavy chain constant region nucleic acid sequence, and the heavy chain constant region gene sequence comprises an intact Igμ gene, an intact Igδ gene, an intact Igγ gene, an intact Igα gene, and/or an intact Igε gene.

In various embodiments, the genetically modified non-human animals express an antibody repertoire (e.g., an IgG repertoire) that is derived from the nucleotide sequence that encodes the rearranged light chain variable domain (or the nucleotide sequence that contains a limited number of human V_(L) gene segments), and a plurality of light chain V segments (and a plurality of light chain J segments). In some embodiments, the genetically modified locus produces an antibody population that comprises an immunoglobulin light chain that is capable of specifically binding an antigen of interest with an affinity (K_(D)) lower than 10⁻⁶, 10⁻⁷, 10⁻⁸, 10⁻⁹ or 10⁻¹⁰. In some embodiments, the immunoglobulin light chain expressed by the genetically modified locus is capable of specifically binding an antigen of interest in the absence of a heavy chain variable region with an affinity (K_(D)) lower than 10⁻⁶, 10⁻⁷, 10⁻⁸, 10⁻⁹, or 10⁻¹⁹.

In various embodiments, the genetic modifications described herein do not affect fertility of the non-human animal (see, for example, U.S. Pat. No. 8,642,835, incorporated by reference in its entirety). In some embodiments, the heavy chain locus, e.g., hybrid chain locus, comprises an endogenous Adam6a gene, Adam6b gene, or both, and the genetic modification does not affect the expression and/or function of the endogenous Adam6a gene, Adam6b gene, or both. In some embodiments, the genome of the genetically modified non-human animal comprises an Adam6a gene, Adam6b gene, or both integrated in the genome at location outside the heavy chain locus or hybrid chain locus. In some embodiments, an Adam6a and/or Adam6b gene is placed 5′ upstream of the unrearranged light chain variable region gene segments. In some embodiments, the Adam6a and/or the Adam6b gene is placed 3′ downstream of the unrearranged light chain variable region gene segments. In some embodiments, the heavy chain locus comprises a functional ectopic mouse Adam6 gene.

The capabilities of the genetically modified non-human animals described herein to apply selective pressure to genes or polynucleotides encoding light chain variable regions or domains (e.g., light chain CDR3s) can be applied to a variety of variable light chain gene sequences. In other words, the rearranged light chain variable region nucleotide sequences disclosed herein can be paired with one or more genetic modifications of a heavy chain locus and/or the insertion of nucleotide sequences encoding light chain variable domains into a heavy chain locus. This can be accomplished by, for example, mating (i.e., cross-breeding or intercrossing of animals with single modification) the non-human animals described herein (restricted to a common or universal light chain variable domain) with non-human animals comprising genetic modifications within one or more heavy chain-encoding loci. Genetically modified non-human animals comprising immunoglobulin light chain loci with a rearranged light chain variable region nucleotide sequence (or a limited number of human V_(L) gene segments) and one or more heavy chain loci modifications can also be generated by targeted gene replacement of multiple loci, either simultaneously or sequentially (e.g., by sequential recombination in embryonic stem cells). Neither the type nor method of modification at the heavy chain loci limits embodiments described herein unless specifically noted. Rather, the selective pressure facilitated by embodiments described herein can be applied to virtually any polynucleotide sequence capable of being expressed and functioning as a heavy chain antigen-binding sequence, thereby driving the evolution of fitter antibody variable regions.

For example, as described herein, genetically modified non-human animals comprising an immunoglobulin locus with a rearranged light chain variable region nucleotide sequence (or a limited number of human V_(L) gene segments) may further comprise (e.g., via cross-breeding or multiple gene targeting strategies) one or more modifications as described in WO 2011/072204, WO 2011/163311, WO 2011/163314, WO 2012/018764, WO 2012/141798, WO 2013/022782, WO 2013/059230, WO 2013/096142, WO 2013/116609, WO 2013/187953; these publications are incorporated herein by reference in their entirety. In particular embodiments, a genetically modified mouse comprising a rearranged light chain variable region nucleic acid sequence, or a limited number of V_(L) gene segments, in a light chain locus (e.g., a rearranged light chain variable domain gene sequence, or two V_(L) gene segments, operably linked to a human or non-human κ light chain constant region gene sequence) is crossed to a genetically modified mouse comprising an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus) comprising human light chain variable region gene segments (e.g., 40 human Vκ genes and all human Jκ genes inserted into a mouse heavy chain locus; see, e.g., U.S. Patent Application Publication no. 2012-0096572 A1, incorporated herein by reference). In specific embodiments, a genetically modified mouse comprising a rearranged light chain variable region nucleic acid sequence, or a limited number of V_(L) gene segments, in a light chain locus (e.g., a rearranged light chain variable region nucleotide sequence, or two V_(L) gene segments, operably linked to a human or non-human κ light chain constant region gene sequence) is crossed to a genetically modified mouse comprising an immunoglobulin heavy chain locus (e.g., hybrid immunoglobulin chain locus) comprising one or more human light chain variable region gene segments. The resulting mice are able to produce Igκ⁺ B cells with variable heavy chains derived from genomic light chain variable sequences, thus facilitating the identification of Vκ domains that bind to specific targets.

EXAMPLES

The following non-limiting examples are set forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how to make and use non-human animals described herein and aid in the understanding thereof, and are not intended to limit the scope of what the inventors regard as their invention nor are they intended to represent that the experiments below are all or the only experiments performed. The Examples do not include detailed descriptions of conventional methods that would be well known to those of ordinary skill in the art (molecular cloning techniques, etc.). Efforts have been made to ensure accuracy with respect to numbers used (e.g. amounts, temperature, etc.) but some experimental errors and deviations should be accounted for. Unless indicated otherwise, parts are parts by weight, molecular weight is weight average molecular weight, temperature is in degrees Centigrade, and pressure is at or near atmospheric.

Example 1. Generation of Non-Human Animals Having Modified Immunoglobulin Loci

This example illustrates exemplary methods of engineering immunoglobulin heavy chain loci of non-human animals to contain (a) an immunoglobulin heavy chain locus comprising unrearranged human inununoglobulin light chain V_(L) and J_(L) gene segments operably linked to an immunoglobulin heavy chain constant region nucleic acid sequence (e.g., hybrid immunoglobulin chain locus); and (b) an immunoglobulin light chain locus comprising a rearranged human immunoglobulin light chain variable region nucleotide sequence operably linked to an immunoglobulin light chain constant region nucleic acid sequence.

Construction of Immunoglobulin Heavy Chain Loci with Light Chain Gene Segments

Wild type mouse heavy chain and human κ light chain loci are depicted in FIG. 1. Construction of exemplary targeting vectors for the insertion of human light chain V and J gene segments (e.g., Vκ and Jκ) into a murine immunoglobulin heavy chain locus is described below. FIG. 2 illustrates four exemplary targeting vectors that contain a plurality of human κ light chain gene segments for insertion into a murine immunoglobulin heavy chain locus using homologous recombination.

Various targeting constructs were made using VELOCIGENE® genetic engineering technology (see, e.g., U.S. Pat. No. 6,586,251 and Valenzuela, D. M., Murphy, A. J., Frendewey, D., Gale, N. W., Economides, A. N., Auerbach, W., Poueymirou, W. T., Adams, N. C., Rojas, J., Yasenchak, J., Chernomorsky, R., Boucher, M., Elsasser, A. L., Esau, L., Zheng, J., Griffiths, J. A., Wang, X., Su, H., Xue, Y., Dominguez, M. G., Noguera, I., Torres, R., Macdonald, L. E., Stewart, A. F., DeChiara, T. M., Yancopoulos, G. D. (2003). High-throughput engineering of the mouse genome coupled with high-resolution expression analysis. Nat Biotechnol 21, 652-659) to modify mouse genomic Bacterial Artificial Chromosome (BAC) libraries. Mouse BAC DNA may be modified by homologous recombination to deletion the endogenous V_(H), D_(H) and J_(H) gene segments for the subsequent insertion of unrearranged human V_(L) and J_(L) gene segments. Alternatively, the endogenous V_(H), D_(H) and J_(H) gene segments may be left intact and inactivated so that recombination of endogenous gene segments to form a functional variable region is inhibited (e.g., by inversion or disruption of gene segments).

Genetically modified mice, and methods of making the same, whose genome contains an immunoglobulin hybrid chain locus comprising unrearranged human immunoglobulin light chain V_(L) and J_(L) gene segments operably linked to an immunoglobulin heavy chain constant region nucleic acid sequence are described in U.S. Patent Application Publication No. 2012-0096572 A1, incorporated herein by reference in its entirety. As shown in FIG. 2, four targeting vectors were engineered to progressively insert 40 human Vκ gene segments and five human Jκ gene segments into an non-human ES cell comprising an inactivated heavy chain locus (e.g., deleted endogenous V_(H), D_(H) and J_(H) gene segments) and/or a light chain locus comprising a single rearranged human V_(L)/J_(L) gene sequence operably linked to a light chain constant region, e.g., a non-human light chain constant region, e.g., at an endogenous non-human light chain locus, using standard molecular techniques recognized in the art. Table 1 sets forth the size of human DNA included in each targeting vector which contains various human κ light chain gene segments for insertion into a mouse immunoglobulin heavy chain locus. Any number of human Vκ and Jκ gene segments may be included in the targeting vectors. The exemplary targeting vectors set forth in FIG. 2 include human κ light chain gene segments that are naturally found in the proximal contig of the germ line human κ light chain locus (FIG. 1). The resulting endogenous heavy chain locus after successive insertion of all four targeting vectors is shown in the bottom of FIG. 2.

TABLE 1 Targeting Size of Human Human κ Gene Segments Added Vector κ Sequence Vκ Jκ 1 ~110.5 kb 4-1, 5-2, 7-3, 2-4, 1-5, 1-6 1-5 3-7, 1-8, 1-9, 2-10, 3-11, 2   ~140 kb 1-12, 1-13, 2-14, 3-15, 1-16 — 1-17, 2-18, 2-19, 3-20, 6-21, 3   ~161 kb 1-22, 1-23, 2-24, 3-25, 2-26, — 1-27, 2-28, 2-29, 2-30 4   ~90 kb 3-31, 1-32, 1-33, 3-34, 1-35, — 2-36, 1-37, 2-38, 1-39, 2-40

Using a similar approach, other combinations of human light chain variable domains in the context of murine heavy chain constant regions may be constructed. Additional light chain variable domains may be derived from human Vλ and Jλ gene segments. Exemplary targeting vectors that include human DNA that include various numbers of human Vλ and Jλ gene segments are set forth in FIG. 3.

The human λ light chain locus extends over 1,000 kb and contains over 80 genes that encode variable (V) or joining (J) segments. Among the 70 Vλ gene segments of the human λ light chain locus, anywhere from 30-38 appear to he functional gene segments according to published reports. The 70 Vλ sequences are arranged in three clusters, all of which contain different members of distinct V gene family groups (clusters A, B and C). Within the human λ light chain locus, over half of all observed Vλ domains are encoded by the gene segments 1-40, 1-44, 2-8, 2-14, and 3-21. There are seven Jλ gene segments, only four of which are regarded as generally functional Jλ gene segments Jλ1, Jλ2, Jλ3, and Jλ7. In some alleles, a fifth Jλ-Cλ gene segment pair is reportedly a pseudo gene (Cλ6). Incorporation of multiple human Jλ gene segments into a hybrid heavy chain locus, as described herein, may be constructed by de novo synthesis. In this way, a genomic fragment containing multiple human Jλ gene segments in germline configuration is engineered with multiple human Vλ gene segments and allows for normal V-J recombination in the context of a heavy chain constant region. An exemplary targeting vector that includes multiple Jλ gene segments is shown in FIG. 3 (Targeting Vector 1′).

Coupling light chain variable domains with heavy chain constant regions represents a potentially rich source of diversity for generating unique V_(L) binding proteins with human V_(L) regions in non-human animals. Exploiting this diversity of the human λ light chain locus (or human κ locus as described above) in mice results in the engineering of unique hybrid heavy chains and gives rise to another dimension of binding proteins to the immune repertoire of genetically modified animals and their subsequent use as a next generation platform for the generation of therapeutics.

The targeting vectors described above are used to electroporate mouse embryonic stem (ES) cells to created modified ES cells for generating chimeric mice that express V_(L) binding proteins (i.e., human light chain gene segments operably linked to mouse heavy chain constant regions). ES cells containing an insertion of unrearranged human light chain gene segments are identified by the quantitative PCR assay, TAQMAN® (Lie and Petropoulos, 1998. Curr. Opin. Biotechnology 9:43-48). Specific primers sets and probes are designed for insertion of human sequences and associated selection cassettes, loss of mouse heavy chain sequences and retention of mouse sequences flanking the endogenous heavy chain locus.

ES cells bearing the human light chain gene segments (e.g., Vκ and Jκ) operably linked to a heavy chain constant region sequence can be transfected with a construct that expresses a recombinase in order to remove any undesired selection cassette introduced by the insertion of the human light chain gene segments. Optionally, the selection cassette may be removed by breeding to mice that express the recombinase (e.g., U.S. Pat. No. 6,774,279). Optionally, the selection cassette is retained in the mice.

Targeted ES cells described above are used as donor ES cells and introduced into an 8-cell stage mouse embryo by the VELOCIMOUSE® method (see, e.g., U.S. Pat. No. 7,294,754 and Poueymirou, W. T., Auerbach, W., Frendewey, D., Hickey, J. F., Escaravage, J. M., Esau, L., Dore, A. T., Stevens, S., Adams, N. C., Dominguez, M. G., Gale, N. W., Yancopoulos, G. D., DeChiara, T. M., Valenzuela, D. M. (2007). F0 generation mice fully derived from gene-targeted embryonic stem cells allowing immediate phenotypic analyses. Nat Biotechnol 25, 91-99). VELOCIMICE® (F0 mice fully derived from the donor ES cell) independently bearing human light chain gene segments at a mouse immunoglobulin heavy chain locus are identified by genotyping using a modification of allele assay (Valenzuela et al., supra) that detects the presence of the unique human light chain gene segments at an endogenous immunoglobulin heavy chain locus. Pups are genotyped and a pup heterozygous or homozygous for the genetically modified immunoglobulin heavy chain locus is selected for characterizing expression of V_(L)-containing heavy chains.

Mice whose genome comprises an immunoglobulin heavy chain allele that contains an insertion of forty (40) unrearranged human Vκ and five (5) Jκ gene segments into an endogenous locus so that said human Vκ and Jκ gene segments are operably linked to endogenous heavy chain constant regions are referred to as MAID1713 (see U.S. Patent Application Publication no. 2012-0096572 A1, incorporated herein by reference in its entirety). Mice having the same and also an integrated mouse Adam6 gene are referred to as MAID1994 (see U.S. Patent Application Publication no. 2013-0212719 A1, herein incorporated by reference in its entirety).

Construction of Immunoglobulin Light Chain Loci with a Rearranged Human Light Chain Nucleotide Sequence

Construction of exemplary targeting vectors for the insertion of a single rearranged human light chain nucleotide sequence (e.g., a single human rearranged V_(L)/J_(L) nucleotide sequence) into a murine immunoglobulin light chain locus are described below. FIG. 4 illustrates a targeting vector that contains a single rearranged human light chain nucleotide sequence for insertion into a murine immunoglobulin light chain locus using homologous recombination.

Genetically modified mice, and methods of making the same, whose genome contains an immunoglobulin light chain locus comprising a rearranged human immunoglobulin light chain variable region nucleotide sequence operably linked to an immunoglobulin light chain constant region nucleic acid sequence are described in U.S. Patent Application Publication No. US 2011-0195454A1, incorporated herein by reference in its entirety. As shown in FIG. 4, a targeting vector was engineered to contain a single rearranged human light chain (i.e., a rearranged human V_(L)/J_(L)) nucleotide sequence for insertion into an ES cell comprising an inactivated mouse κ light chain locus (e.g., deleted endogenous Vκ and Jκ gene segments) and, optionally, a hybrid immunoglobulin locus, using standard molecular techniques recognized in the art. The single rearranged human light chain nucleotide sequence may include any human V_(L) and human J_(L) sequence. Suitable exemplary rearranged human light chain nucleotide sequences that can be employed include those derived from a rearranged human Vκ1-39Jκ5 nucleotide sequence (MAID1633, FIG. 5), a rearranged human Vκ3-20Jκ1 nucleotide sequence (MAID1635, FIG. 5).

Alternatively, as described above, in some embodiments, a mouse may also be engineered to comprise an insertion of human Vλ and Jλ gene segments into an endogenous immunoglobulin heavy chain locus so that said human Vλ and Jλ gene segments are operably linked to heavy chain constant regions. In such embodiments, to achieve optimal expression and usage of the inserted human Vλ and gene segments, those skilled in the art are aware that one might use a rearranged sequence such as a rearranged human VλJλ nucleotide sequence. Such rearranged human VλJλ nucleotide sequence would provide a better ability of the rearranged human VλJλ sequences in the context of a heavy chain constant region to pair with the rearranged human VλJλ sequence in the context of a light chain constant region. Rearranged human VκJκ sequences in the context of heavy chain constant regions may not be able to effectively associate with rearranged VλJλ sequences in the context of light chain constant regions (see US 2012-0096572 A1). Therefore, an exemplary rearranged human VλJλ sequence includes a rearranged human Vλ2-14Jλ1 nucleotide sequence.

The targeting vector described above is used to electroporate mouse embryonic stem (ES) cells, which may optionally comprise a hybrid immunoglobulin locus, to create modified ES cells for generating chimeric mice that express light chains encoded by a single rearranged human light chain nucleotide sequence (i.e., a single human VOL nucleotide sequence operably linked to mouse light chain constant regions). ES cells containing an insertion of a single rearranged human light chain nucleotide sequence is identified by the quantitative PCR assay, TAQMAN® (Lie and Petropoulos, 1998. Curr. Opin. Biotechnology 9:43-48). Specific primers sets and probes are designed for insertion of the single rearranged human light chain nucleotide sequence and associated selection cassettes, loss of mouse light chain sequences and retention of mouse sequences flanking an endogenous light chain locus.

ES cells bearing the single rearranged human light chain nucleotide sequence can be transfected with a construct that expresses a recombinase in order to remove any undesired selection cassette introduced by the insertion of the single rearranged human light chain nucleotide sequence. Optionally, the selection cassette may be removed by breeding to mice that express the recombinase (e.g., U.S. Pat. No. 6,774,279). Optionally, the selection cassette is retained in the mice.

Targeted ES cells described above are used as donor ES cells and introduced into an 8-cell stage mouse embryo by the VELOCIMOUSE® method (see, e.g., U.S. Pat. No. 7,294,754 and Poueymirou, W. T., Auerbach, W., Frendewey, D., Hickey, J. F., Escaravage, J. M., Esau, L., Dore, A. T., Stevens, S., Adams, N. C., Dominguez, M. G., Gale, N. W., Yancopoulos, G. D., DeChiara, T. M., Valenzuela, D. M. (2007). F0 generation mice fully derived from gene-targeted embryonic stem cells allowing immediate phenotypic analyses. Nat Biotechnol 25, 91-99). VELOCIMICE® (F0 mice fully derived from the donor ES cell) independently bearing a single rearranged human light chain nucleotide sequence at a mouse immunoglobulin light chain locus are identified by genotyping using a modification of allele assay (Valenzuela et al., supra) that detects the presence of the unique rearranged human light chain nucleotide sequence at an endogenous immunoglobulin light chain locus. Pups are genotyped and a pup heterozygous or homozygous for the genetically modified immunoglobulin light chain locus is selected for characterizing expression of the single human light chain.

Example 2. Characterization of Mice Comprising a Single Rearranged Human Immunoglobulin Light Chain Nucleotide Sequence and a Plurality of Human κ Light Chain Gene Segments

Mice comprising a rearranged light chain variable region nucleic acid sequence in a light chain locus (ULC Mouse: MAID1633, single rearranged human Vκ1-39/Jκ5 or MAID1635, single rearranged human Vκ3-20Jκ1) were generated as described above. Briefly, in the ULC mouse, all endogenous functional light chain variable gene segments were deleted and replaced with a single rearranged light chain variable region nucleic acid sequence (e.g., a sequence that encodes a human Vκ1-39/Jκ5 or a human Vκ3-20Jκ1), which is operably linked to an endogenous light chain constant region nucleic acid sequence.

Mice comprising genetically engineered heavy chain loci containing unrearranged human immunoglobulin light chain V_(L) and J_(L) gene segments in a heavy chain locus (KOH Mouse: MAID1713: 40 human Vκ gene segments and five human Jκ gene segments; MAID1994: 40 human Vκ gene segments and five human Jκ gene segments, and an integrated Adam6 gene) were generated as described above. Briefly, in the KOH Mouse, all endogenous functional heavy chain variable gene segments were deleted and replaced with 40 unrearranged human Vκ gene segments and five (5) unrearranged human Jκ gene segments, which are operably linked to an immunoglobulin heavy chain constant region nucleic acid sequence.

Homozygous ULC mice (MAID1633 or MAID 1635) described above were bred to homozygous KOH mice (MAID1713 or MAID 1994) mice to produce a mouse heterozygous for the ULC allele and the KOH allele. F1 heterozygous mice generated from this cross were bred to each other to obtain mice homozygous for each allele (MAID1713HO 1633HO, MAID1713HO 1635HO, MAID1994HO 1633HO, or MAID1994HO 1635HO; “KOH×ULC”). Such mice express V_(L) binding proteins that have a structure that resembles that of immunoglobulins, but yet are distinct in that such binding proteins lack heavy chain variable domains. The presence of the genetically modified alleles in the immunoglobulin heavy chain and light chain loci was confirmed by TAQMAN™ screening and karyotyping using specific probes and primers described above. The homozygous KOH×ULC mice comprise an insertion of unrearranged human light chain gene segments as described herein (e.g., human Vκ and Jκ) into the mouse heavy chain locus in which all endogenous variable heavy chain VDJ gene segments have been deleted and an insertion of a single rearranged human light chain variable region nucleotide sequence (MAID1633: rearranged human Vκ1-39.110; MAID1635: rearranged human Vκ3-20Jκ1) into the mouse kappa (κ) light chain locus in which all mouse Vκ and Jκ genes have been deleted (FIG. 6). In some embodiments, KOH×ULC mouse further comprise an integrated Adam6 gene.

Alternatively, to generate mice comprising both ULC allele and KOH allele, ES cells harboring a ULC modification or ES cells harboring a KOH modification are targeted with KOH or ULC targeting vector, respectively. Mice are generated from ES cells harboring both modifications by introducing ES cells into an 8 stage mouse embryo by VELOCIMMUNE® method and screening as described above in Example 3. F1 heterozygous mice are bred to obtain homozygous mice.

All mice were housed and bred in specific pathogen-free conditions at Regeneron Pharmaceuticals, Inc. Three KOH (MAID1994HO 1242HO; see U.S. Patent Application Publication No. US 2013-0212719 A1, incorporated by reference herein) mice (˜11 weeks old, male) and two groups of three KOH×ULC (MAID1994HO 1633HO, 12 weeks old, female; MAID1994HO 1635HO, 11 weeks old, 2 male and 1 female) mice were sacrificed, and spleens and bone marrow were harvested from the animals. Bone marrow was collected from femurs by flushing with complete RPMI medium (RPMI medium supplemented with fetal calf serum, sodium pyruvate, Hepes, 2-mercaptoethanol, non-essential amino acids, and gentamycin). Red blood cells from spleen and bone marrow preparations were lysed with ACK lysis buffer and washed with complete RPMI medium.

Flow Cytometry

In order to examine the ability of the genetically modified homozygous “KOH×ULC” (MAID1994HO 1633HO and MAID1994HO 1635HO) mice described herein to produce V_(L) binding proteins derived from the genetically modified alleles (e.g., from the allele that contains a single copy of the rearranged human light chain nucleotide sequence in the light chain locus and the allele that contains unrearranged human Vκ and Jκ gene segments in the heavy chain locus), fluorescence-activated cell sorting (FACS) analysis was performed. KOH mice comprising an unrearranged light chain locus comprising unrearranged human V_(L) and J_(L) gene segments (1994 HO 1242 HO), as well as VELOCIMMUNE® mice comprising unrearranged human heavy and light chain gene segments on mouse heavy and light chain loci, respectively (VI3) were used as controls.

Briefly, 1×10⁶ cells were incubated with anti-mouse CD16/CD32 (clone 2.4G2, BD Pharmigen) on ice for 10 minutes, followed by labeling with the following antibody cocktail for 30 minutes on ice: APC-H7 conjugated anti-mouse CD19 (clone 1D3, BD Pharmigen), Pacific Blue conjugated anti-mouse CD3 (clone 17A2, BioLegend), FITC conjugated anti-mouse Igκ (clone 187.1, BD Pharmigen) or anti-mouse CD43 (clone 1B11, BioLegend), PE conjugated anti-mouse Igλ (clone RML-42, BioLegend) or anti-mouse c-kit (clone 2B8, BioLegend), PerCP-Cy5.5 conjugated anti-mouse IgD (BioLegend), PE-Cy7 conjugated anti-mouse IgM (clone II/41, eBioscience), APC conjugated anti-mouse B220 (clone RA3-6B2, eBioscience). Following staining, cells were washed and fixed in 2% formaldehyde. Data acquisition was performed on an LSR11 flow cytometer and analyzed with FlowJo (Tree Star, Inc.). Gating: total B cells (CD19⁺CD3⁻), B cells (Igκ⁺Igλ⁻ CD19⁺CD3⁻), Igλ⁺ B cells (Igκ⁻Igλ⁺CD19⁺CD3⁻). Results for the bone marrow compartment are shown in FIGS. 7-10. Results for the splenic compartment are shown in FIGS. 11-15.

Only mature B lymphocytes can enter the lymphoid follicles of spleen and lymph nodes and thus efficiently participate in the immune response. Mature, long-lived B lymphocytes derive from short-lived precursors generated in the bone marrow. Selection into the mature pool is an active process and takes place in the spleen. Two populations of splenic B cells have been identified as precursors for mature B cells. Transitional B cells of type 1 (T1) are recent immigrants from the bone marrow. They develop into the transitional B cells of type 2 (T2), which are cycling and found exclusively in the primary follicles of the spleen. Mature B cells can be generated from T1 or T2 B cells. Loder, F. et al., J. Exp. Med., 190(1): 75-89, 1999.

The FACS analysis (FIGS. 7-15) suggested that the KOH×ULC mice were able to produce nearly normal B cell populations in the bone marrow compartment (FIGS. 7-8). Interestingly, KOH×ULC mice demonstrate a lack of lambda (λ) expression in the bone marrow (FIG. 10).

In the splenic compartment, KOH×ULC mice produced nearly normal B cell populations (FIGS. 11, 12, and 14). As in the hone marrow compartment, KOH×ULC mice demonstrated a lack of lambda (λ) expression in the spleen (FIGS. 11 and 12). Also in the splenic compartment, KOH×ULC mice demonstrated nearly normal transitional and mature B cell populations as compared to VELOCIMMUNE® (VI3) mice (FIGS. 13-15).

Taken together, these data show that the KOH×ULC mice provided by the present invention, such as those with the genetic modifications described in Example 1, are healthy and demonstrate a near wild-type B cell development. Moreover, such mice express binding proteins that resemble natural antibodies in structure, but yet lack heavy chain variable region sequences.

Finally, as depicted in FIG. 16, mice comprising the genetic modifications described herein were capable of generating antigen-specific titers when immunized with Antigen 1 (a cell surface receptor).

Example 3. Antigen-Binding Characterization of V_(L/CH×ULC) Domains from KOH×ULC Mice

This example illustrates exemplary methods of obtaining nucleic acid sequences that encode an immunoglobulin light chain variable domain (V_(L/CH×ULC)) that can detectably bind an antigen independently from a cognate variable domain, e.g., a cognate universal light chain variable domain. Exemplary V_(L/CH×ULC) domains that detectably bind an antigen independently from a cognate variable domain are obtained from genetically modified non-human animals (e.g., mice) whose genome includes an immunoglobulin heavy chain locus (hybrid immunoglobulin chain locus) containing unrearranged human light chain gene segments (e.g., V_(L) and J_(L) gene segments) operably linked to a heavy chain constant region sequence and an immunoglobulin light chain locus containing a rearranged immunoglobulin light chain variable sequence (i.e., a universal or common light chain variable region) operably linked to a light chain constant region sequence. Such non-human animals express binding proteins that contain immunoglobulin light chain V_(L/CH×ULC) variable domains operably linked to a heavy chain constant regions and common immunoglobulin light chain variable domains operably linked to a light chain constant regions, wherein the V_(L/CH×ULC) light chains are derived from the unrearranged human light chain gene segments, and wherein the common light chain variable domains are encoded by the single rearranged light chain variable gene sequence.

Preparation of a V L/CH×ULC, specifically a V_(κOH×ULC) immunoglobulin light chain variable domain, that retains antigen binding when paired with an unrelated, e.g., noncognate, human V_(H) domain was performed. KOH×ULC mice were immunized with a cell surface protein (Antigen 2). Antigen positive B-cells were sorted from two KOH×ULC mice; MA1D1712 1635 (KOH×ULC:Vκ3-20Jκ1). Cells were sorted based on Antigen 2 and 1536 B-cells were collected. 384 B-cells were processed from the “best” mouse as judged during sorting. 176 KOH V_(L) domains, e.g., V_(κ/CH×ULC) domains were cloned into Fab plasmids. Individual sequences encoding one of 176 KOH VL domains were cloned into Fab plasmids along with a sequence encoding a human Vκ3-20 germline ULC sequence. Each sequence encoding a KOH VL (V_(κOH×ULC)) domain was cloned operably linked with a heavy chain constant region sequence (i.e. CH1) and the ULC sequence was closed operably linked with a light chain constant κ gene sequence. Transient transfections were carried out to produce protein for Ag+ screening. Screening for Antigen 2 binding was assayed by ELISA and BIACORE™. Fourteen (14) samples bound Antigen 2 at neutral pH as determined by ELISA, as shown in FIG. 17. Binding was confirmed by BIACORE™ for 13 of the 14 ELISA binders.

Subsequently, two KOH derived V_(L) (V_(κOH×ULC)) domains were chosen and independently cloned and reformatted with light chain constant regions (i.e., Cκ). Each of the reformatted KOH V_(L)/Cκ chains were independently paired with non-cognate V_(H) domain formatted with a heavy chain C_(H) to form a typical antibody structure. Notably, the non-cognate V_(H) domain was generated in a mouse that was genetically modified to generate all V_(H) domains from a single rearranged heavy chain variable region sequence, see, e.g., U.S. Patent Publication No. 20140245468, incorporated herein in its entirety by reference, and immunized with an unrelated enzyme (Antigen 3). The reformatted V_(L) domains were tested for Antigen 2 binding by BIACORE™. Results are shown in FIG. 18.

In FIG. 18, antibodies A and B each comprise an immunoglobulin light chain comprising a distinct KOH V_(L) (V_(κOH×ULC)) fused with a Cκ constant domain and an immunoglobulin heavy chain comprising a V_(H) domain fused with an intact C_(H) domain. In contrast, while antibody C comprises the same immunoglobulin heavy chain as antibodies A and B, antibody C comprises an immunoglobulin light chain comprising a V_(L) domain that is cognate to the V_(H) domain fused with a Cκ domain.

The results show that when a V_(H) domain derived from a single rearranged heavy chain variable region and raised against Antigen 3 is paired with cognate light chain variable domains, antigen binding to Antigen 3 is maintained (see, FIG. 18; showing antibody C binds to Antigen 3 as expected). Not surprising, when the same V_(H) domain was paired with a noncognate KOH V_(L) domain, Antigen 3 binding was undetectable (FIG. 18).

In contrast, Antigen 2 binding was maintained for both KOH V_(L) (V_(κOH×ULC)) domains (i.e., antibody A and B V_(L) domains) despite being (1) reformatted onto a Cκ domain and (2) paired with a non-cognate V_(H) domain.

The results suggest that KOH antibodies isolated from KOH×ULC mice can bind antigen solely through one V_(L) domain (i.e., a Vκ domain). This is confirmed by reformatting a KOH V_(L) (V_(κOH×ULC)) domain onto a light chain backbone (i.e., a Cκ region) and pairing with a V_(H) domain raised against a different antigen. Such a molecule was shown to retain binding to the antigen to which the parental KOH antibody (i.e., V_(κOH×ULC) domain) was raised.

Taken together, this Example demonstrates that KOH×ULC mice provide a robust in vivo system to select for “antibody-like” molecules that bind antigen solely through a V_(L/CH×ULC) domain (e.g., Vκ_(/CH×ULC)), i.e., independent of a cognate variable domain. Such mice provide the opportunity to select V_(L) domains (Vκ_(/CH×ULC) or Vλ_(/CH×ULC)) that bind antigen in the absence of a cognate variable domain and/or when paired with a noncognate variable domain. The V_(L) binding proteins expressed by the mice described herein may provide a novel paratope or binding surface to targets that evolve to avoid conventional antibodies (e.g., HIV and influenza).

Example 4. Making a Multi-Specific Antigen Binding Protein Comprising a V_(L/CH×ULC) Domain

This example illustrates an exemplary method of making a multi-specific antigen-binding protein comprising a light chain variable V_(L/CH×ULC) domain derived from an immunoglobulin hybrid chain that is cognate with a universal light chain. As described in Example 3, a first nucleic acid sequence encoding a KOH V_(L) domain, e.g., a V_(κ/CH×ULC), is isolated from a non-human animal genetically modified to comprise in its genome an immunoglobulin hybrid chain locus containing unrearranged human light chain gene segments (e.g., V_(κ) and J_(κ) gene segments) operably linked to a heavy chain constant region sequence and an immunoglobulin light chain locus containing a rearranged immunoglobulin light chain variable sequence (i.e., a universal or common light chain variable region) operably linked to a light chain constant region sequence. A second nucleic acid encoding a second V_(L/CH×ULC) domain may also be isolated from a non-human animal genetically modified to comprise in its genome an immunoglobulin hybrid chain locus containing unrearranged human light chain gene segments (e.g., V_(L) and J_(L) gene segments) operably linked to a heavy chain constant region sequence and an immunoglobulin light chain locus containing a rearranged immunoglobulin light chain variable sequence (i.e., a universal or common light chain variable region) operably linked to a light chain constant region sequence. Alternatively, a second nucleic acid encoding a heavy chain variable V_(H×ULC) domain that binds the second antigen and is cognate to a universal light chain may be isolated from a non-human animal genetically modified with a universal light chain (“ULC”), see, e.g., 2011-0195454 A1, US 2012-0021409A1, US 2012-0192300A1, US 2013-0045492A1, US 2013-0185821A1 and US 2013-0302836A1, incorporated by reference herein in their entireties) or a restricted (limited) immunoglobulin light chain variable region gene segment repertoire (e.g., a restricted immunoglobulin light chain variable segment repertoire comprising two or more but less than the wild type number of human V_(L) gene segments; for example, a dual light chain, or “DLC”, U.S. Patent Application Publication No. US-2013-0198880-A1, incorporated by reference herein in its entirety).

A first binding component encoded by a nucleic acid comprising the first nucleic acid sequence encoding the first a V_(L/CH×ULC) domain that binds the first antigen may be co-expressed in a cell with a second binding component encoded by the second nucleic acid comprising a nucleic acid sequence encoding the second variable V_(L/CH×ULC) domain or V_(H/CH×ULC) domain that binds the second antigen such that the first and second binding components are expressed as a multi-specific, e.g., a bi-specific antigen-binding protein. Exemplary pairing formats include the first and second binding components respectively pairing in an Fv format, an scFv format, a Fab format, an scFab format, a tetrameric antibody format wherein the first and second binding components are each heavy chains comprising a functional C_(H)1 domain associated with a universal light chain, or a tetrameric antibody format wherein one of the first or second binding components is a heavy chains comprising a functional C_(H)1 domain and is associated with the other of the first or second binding component as a light chain

KOH×ULC mice comprising unrearranged human light chain variable region gene segments were immunized with Antigen A, a multivalent high molecular weight protein, to form V_(κ/CH×ULC) variable domains specific for Antigen A. ULC mice comprising unrearranged human heavy chain variable region gene segments as described in e.g., 2011-0195454 A1, US 2012-0021409A1, US 2012-0192300A1, US 2013-0045492A1, US 2013-0185821A1 and US 2013-0302836A1, incorporated by reference herein in their entireties, were immunized with Antigen B, a monomeric lower molecular weight protein, to form V_(H×ULC) variable domains specific for antigen B.

B-cells expressing antigen-binding proteins capable of binding Antigen A or Antigen B were respectively sorted from KOH×ULC or ULC mice as described in U.S. Pat. No. 7,582,298, incorporated herein by reference. Both KOH×ULC and ULC mice used in this study were genetically modified with a ULC encoded by a rearranged immunoglobulin light chain comprising a human Vκ3-20 gene segment rearranged with a human Jκ1 gene segment.

Briefly, red blood cells were removed by lysis followed by pelleting the harvested splenocytes. Resuspended splenocytes were first incubated with a cocktail of human IgG, FITC-anti-mFc, and Antigen A labeled with biotin or Antigen B labeled with biotin (as appropriate) for 1 hour. The stained cells were washed twice with PBS, then stained with a cocktail of human and rat IgG, APC-anti-mIgM, and SA-PE for one hour. The stained cells were washed once with PBS and were analyzed by flow cytometry on a Reflection (Sony). Each IgG positive, IgM negative, and antigen positive B cell was sorted and plated into a separate well on a 384-well plate. RT-PCR of antibody genes from these B cells was performed according to a method described by Wang et al. (2000) (J Immunol Methods 244:217-225).

Briefly, cDNAs for each single B cell were synthesized via reverse transcription (RT). The V_(κOH×ULC) region DNA sequences from Antigen A immunized KOH×ULC mice were amplified by PCR using a 5′ degenerate primer specific for human kappa chain variable region leader sequence and a 3′ primer specific for mouse heavy chain constant region, to form an amplicon. The amplicon was then amplified again by PCR using a 5′ degenerate primer set specific for framework 1 of human kappa variable region sequence and a nested 3′ primer specific for mouse heavy chain constant region. The V_(KOH×ULC) PCR product was cloned into a first Sap I-linearized antibody vector containing human IgG1 heavy chain constant region and an expression cassette for the universal light chain derived from the rearranged Vκ3-201)(1. The heavy chain variable region DNA sequences from Antigen B immunized ULC mice were amplified by PCR using a 5′ degenerate primer specific for human IgG heavy chain variable region leader sequence and a 3′ primer specific for mouse heavy chain constant region, to form an amplicon. The amplicon was then amplified again by PCR using a 5′ degenerate primer set specific for framework 1 of human IgG heavy chain variable region sequence and a nested 3′ primer specific for mouse heavy chain constant region. The V_(H/CH×ULC) PCR products were cloned into a second Sap I-linearized antibody vectors containing a human IgG1 heavy chain constant region.

Purified recombinant plasmid having a rearranged gene encoding the universal light chain derived from the rearranged Vκ3-20Jκ1 sequence operably linked to a human κ constant gene and a V_(L/CH×ULC) sequence operably linked with the human IgG1 constant region sequence, and a purified plasmid having a V_(H/CH×ULC) sequence operably linked with the human IgG1 constant region sequence were combined and transfected into a CHO host cell line. Stably transfected CHO cell pools were isolated after selection with 400 μg/ml hygromycin for 12 days. The CHO cell pools were used to produce the antigen-binding proteins as shown in FIG. 19A.

Equilibrium dissociation constants (K_(D)) for selected antibody supernatants or purified antibodies were determined by SPR (Surface Plasmon Resonance) using a Biacore T200 or 4000 instrument (GE Healthcare). All data was obtained using HBS-EP (10 mM Hepes, 150 mM NaCl, 0.3 mM EDTA, 0.05% Surfactant P20, pH 7.4) as both the running and sample buffers, at 25° C. or 37° C. Antibodies were captured from crude supernatant samples or purified mAbs on a CM4 or CM5 sensor chip surface previously derivatized with a high density of anti-human Fc antibodies using standard amine coupling chemistry. During the capture step, supernatants or purified mAbs were injected across the anti-human Fc surface at a flow rate of 10 μL/min, for a total of 0.5-2.0 minutes. The capture step was followed by an injection of either running buffer or Antigen A at a concentration range from 3.125 nM-100 nM for 1.5-3.0 minutes at a flow rate of 30 pit/min or Antigen B at a concentration range from 0.37 nM-90 nM for for 3.0 minutes. Dissociation of antigens from the captured antibody was monitored for 3.0-5.0 minutes. The captured antibody was removed by a brief injection of 10 mM glycine, pH 1.5. All sensorgrams were double referenced by subtracting sensorgrams from buffer injections from the analyte sensorgrams, thereby removing artifacts caused by dissociation of the antibody from the capture surface. Binding data for each antibody was fitted to a 1:1 binding model with mass transport limitation.

The binding affinities of universal light chain antibodies are shown in FIG. 20, which exhibits K_(D) values in the nanomolar range. Specifically, all bispecific antibodies (B1-B3) comprising a V_(κ/CH×ULC) binding component, a V_(H×ULC) binding component, and universal light chain bound to Antigen A with affinities ranging from 6.8 to 9.6 nM at 25° C. (FIG. 20) and with affinities ranging from 100-140 nM and t_(1/2) values of less than about 1 min at 37° C. (data not shown). The bispecific antibodies also bound to Antigen B with affinities ranging from about 5100 nM at 25° C. (FIG. 20) and with affinities ranging from 174-178 nM at 37° C. (data not shown).

Control monospecific antibodies (antibodies C_(KOH)1-C_(KOH)3), which were raised against Antigen A and included universal light chain variable domains paired with bivalent V_(κCH×ULC) domains, which were respectively cloned to produce the bispecific antibodies B1-B3, bound to Antigen A with affinities ranging from 2-8 nM at 25° C., but not Antigen B (FIG. 20). Without wishing to be bound by theory, it is possible that the differences in the t_(1/2) values at 25° C. observed for Antigen A interactions with the bivalent antibodies (C_(KOH)1-C_(KOH)3) compared to the bispecific antibodies (B1-B3) may be due to the multivalent nature of Antigen A, which may contribute to a predominantly avidity driven interaction. Dissociation constants (t_(in)) were not determined for C_(KOH)1-C_(KOH)3 antibodies at 37° C.

A control monospecific antibody (antibody C_(vn)), which was raised against antigen B and included universal light chain variable domains paired with a bivalent hV_(H×ULC) domain, which was cloned to produce each of bispecific antibodies B1-B3, bound antigen B with an affinity of 5.2 nM at 25° C. and a t_(in), value (41.1 min) that was similar to t_(in), values (23-31 min) with which Antigen B dissociated from the bispecific antibodies (FIG. 20). Binding of C_(VH) to Antigen B at 37° C. was not tested.

An isotype control antibody (C_(I)) did not bind to either antigen A or antigen B (FIG. 20). Another control anti-B antibody (C) in typical antibody format, e.g., having two heavy chains, each comprising a V_(H) domain fused with a C_(H) domain, and two light chains, each having a V_(L) domain fused with a C_(L) domain, bound to antigen B with affinity of 1.4 nM and did not bind to antigen A (FIG. 20).

Taken together, this Example demonstrates that a V_(L/CH×ULC) domain generated in a KOH×ULC non-human animal is capable of binding antigen in a multi-specific format with another variable domain specific for a second distinct epitope.

EQUIVALENTS

Having thus described several aspects of at least one embodiment of this invention, it is to be appreciated by those skilled in the art that various alterations, modifications, and improvements will readily occur to those skilled in the art. Such alterations, modifications, and improvements are intended to be part of this disclosure, and are intended to be within the spirit and scope of the invention. Accordingly, the foregoing description and drawing are by way of example only and the invention is described in detail by the claims that follow.

It should also be understood that any embodiment or aspect of the invention can be explicitly excluded from the claims, regardless of whether the specific exclusion is recited in the specification.

Those skilled in the art will appreciate typical standards of deviation or error attributable to values obtained in assays or other processes described herein.

The publications, websites and other reference materials referenced herein to describe the background of the invention and to provide additional detail regarding its practice are hereby incorporated by reference. 

1. An antigen binding protein comprising a first binding component comprising a first human light chain variable domain that binds a first epitope, wherein the first human light chain variable V_(L/CH×ULC) domain is identical to or derived from a first human light chain variable domain of a first immunoglobulin hybrid chain that has an isotype selected from the group consisting of IgD, IgG, IgE and IgA, comprises a functional C_(H)1 domain, and is cognate with a first universal light chain variable domain derived from a human universal rearranged V_(L)/J_(L) gene sequence, optionally wherein the first V_(L/CH×ULC) domain is fused to either a heavy chain constant domain or a light chain constant domain, and optionally wherein the first V_(L/CH×ULC) domain is associated with a first universal light chain variable domain derived from the human universal rearranged V_(L)/J_(L) gene sequence.
 2. The antigen binding protein of claim 1, further comprising a second binding component comprising a second human variable domain that binds a second epitope, wherein the second human variable domain is either (i) a second human light chain variable V_(L/CH×ULC) domain that is identical to or derived from a second human light chain variable domain of a second hybrid immunoglobulin chain that is cognate with a second universal light chain variable domain derived from the human universal rearranged V_(L)/J_(L) gene sequence, or (ii) a human heavy chain variable V_(H×ULC) domain that is cognate with a second universal light chain variable domain derived from the human universal rearranged V_(L)/J_(L) gene sequence; optionally wherein the second human (V_(L/CH×ULC) or V_(H×ULC)) variable domain is fused to either a heavy chain constant domain or a light chain constant domain, and optionally wherein the second human (V_(L/CH×ULC) or V_(H×ULC)) variable domain is associated with a second universal light chain variable domain derived from the human universal rearranged V_(L)/J_(L) gene sequence.
 3. The antigen-binding protein of claim 2, wherein the first variable V_(L/CH×ULC) and second variable domains are linked by a peptide linker such that the first and second binding components pair in an scFv-like format.
 4. The antigen-binding protein of claim 2, wherein one of the first variable V_(L/CH×ULC) or second variable domains is fused to a functional C_(H)1 domain, wherein the other of first variable V_(L/CH×ULC) or second variable domains is fused to a light chain constant (C_(L)) domain, and wherein the C_(H)1 and C_(L) domains are linked by a peptide linker or a disulfide bond such that the first and second binding components pair in a Fab-like format or an scFab-like format.
 5. The antigen-binding protein of claim 2, wherein the first binding component comprises (a) the first variable V_(l/CH×ULC) domain fused to a first functional C_(H)1 domain, a first hinge region, a first C_(H)2 domain, a first C_(H)3 domain, and optionally a first C_(H)4 domain, and (b) the first universal light chain variable domain fused to a first light chain constant (C_(L)) domain, wherein the first functional C_(H)1 is linked to the first C_(L) domain by a disulfide bond or a peptide linker such that (a) and (b) pair in a Fab-like structure, wherein the second binding component comprises (a′) the second variable domain fused to a second functional C_(H)1 domain, a second hinge region, a second C_(H)2 domain, a second C_(H)3 domain, and optionally a second C_(H)4 domain, and (b′) the second universal light chain variable domain fused to a second light chain constant (C_(L)) domain, wherein the second functional C_(H)1 is linked to the second light chain constant domain by a disulfide bond or a peptide linker such that (a′) and (b′) pair in a Fab-like structure, wherein the first and second binding components pair such that the antigen binding protein forms a tetrameric binding protein, and optionally wherein the first and second C_(H)3 domains have different affinities to Protein A.
 6. The antigen-binding protein of claim 2, wherein the first and second binding components are identical.
 7. The antigen-binding protein of claim 2, wherein the second human variable domain that binds a second epitope is a human heavy chain variable V_(H×ULC) domain.
 8. The antigen-binding protein of claim 2, wherein the second human variable domain that binds a second epitope is a second variable V_(L/CH×ULC) domain, and wherein the first and second epitopes are not identical.
 9. The antigen-binding protein of any one of the preceding claim 1, wherein the human universal rearranged V_(L)/J_(L) gene sequence is a human universal rearranged Vκ/Jκ gene sequence.
 10. The antigen-binding protein of claim 9, wherein the human universal rearranged Vκ/Jκ gene sequence is a human universal rearranged Vκ1-39/Jκ gene sequence or a human universal rearranged human Vκ3-20/Jκ gene sequence.
 11. The antigen-binding protein of claim 10, wherein the human rearranged Vκ1-39/Jκ gene sequence is a rearranged human Vκ1-39/R5 gene sequence or wherein the human rearranged Vκ3-20/Jκ gene sequence is a rearranged human Vκ3-20/R1 gene sequence.
 12. The antigen binding protein of claim 1, wherein all heavy chain and light chain constant regions are respectively human heavy chain and light chain constant regions.
 13. A non-human animal useful for generating a human V_(L/CH×ULC) domain comprising in its germline genome: (i) a hybrid immunoglobulin locus that encodes an immunoglobulin hybrid chain, wherein the hybrid immunoglobulin locus comprises unrearranged human immunoglobulin light chain variable region gene segments (V_(L) and J_(L)) operably linked to an immunoglobulin heavy chain constant region nucleic acid sequence comprising one or more heavy chain constant region genes, each of which encodes at least a functional C_(H)1 domain, wherein the V_(L) and J_(L) gene segments are capable of rearranging to form a hybrid sequence comprising a rearranged human V_(L)/J_(L) gene sequence operably linked to the immunoglobulin heavy chain constant region nucleic acid sequence; (ii) a light chain locus that encodes a human universal light chain and comprises a human universal rearranged light chain variable region nucleotide sequence operably linked to an immunoglobulin light chain constant region nucleic acid sequence; wherein the non-human animal is capable of producing an antigen-binding protein that comprises a human immunoglobulin hybrid chain derived from the hybrid locus and a cognate human universal light chain derived from the light chain locus, wherein the human immunoglobulin hybrid chain comprises a human immunoglobulin light chain variable (hV_(L/CH×ULC)) domain fused to a heavy chain constant IgD, IgG, IgE or IgA region comprising a functional C_(H)1 domain, and wherein the human universal light chain comprises a human immunoglobulin light chain fused to a light chain constant domain. 14.-20. (canceled)
 21. A nucleic acid encoding a binding component that comprises a human immunoglobulin light chain variable V_(L/CH×ULC) domain, wherein the nucleic acid comprises a human rearranged V_(L)/J_(L) gene sequence operably linked with a non-human or human immunoglobulin heavy chain constant gene or a non-human or human immunoglobulin light chain constant gene.
 22. The nucleic acid of claim 21, wherein the non-human immunoglobulin heavy chain constant gene is selected from the group consisting of an intact rodent Iv gene, an intact rodent Igδ gene, an intact rodent Igγ gene, an intact rodent Igε gene, an intact rodent Igα gene, and a combination thereof.
 23. The nucleic acid of claim 21, wherein the human immunoglobulin heavy chain constant gene is selected from the group consisting of a human Iv gene, a human Igδ gene, a human Igγ gene, a human Igε gene, a human Igα gene, and a combination thereof.
 24. The nucleic acid of claim 23, wherein the human immunoglobulin heavy chain constant gene is a human Igγ gene.
 25. The nucleic acid of claim 24, wherein the human Igγ gene encodes an IgG subclass selected from the group consisting of IgG1, IgG2 and IgG4.
 26. The nucleic acid of claim 25, wherein the human Igγ gene comprises a mutation in the C_(H)3 encoding sequence that reduces binding affinity of the encoded C_(H)3 domain to Protein A compared to a wildtype C_(H)3 domain.
 27. A nucleic acid encoding a binding component that comprises a human immunoglobulin light chain variable V_(L/CH×ULC) domain fused to a light chain constant region, wherein the nucleic acid comprises a human rearranged V_(L)/J_(L) gene sequence operably linked with a non-human or human immunoglobulin light chain constant gene.
 28. A cell comprising the nucleic acid of claim
 21. 29. The cell of claim 28, wherein the cell is a lymphocyte.
 30. The cell of claim 29, wherein the cell is a host cell.
 31. A method of making a multi-specific antigen-binding protein comprising co-expressing in a host cell (i) a first nucleic acid comprising a nucleic sequence that encodes a human V_(L/CH×ULC) domain operably linked with a first non-human or human heavy chain constant region gene comprising a functional C_(H)1 domain encoding sequence, wherein the human V_(L/CH×ULC) domain binds a first epitope independently of a cognate variable domain, (ii) a second nucleic acid comprising a nucleic acid sequence that encodes either a human V_(H×ULC) domain or a second human V_(L/CH×ULC) domain that is fused with a second non-human or human heavy chain constant region gene comprising a functional C_(H)1 domain encoding sequence, wherein the V_(H×ULC) domain or second V_(L/CH×ULC) domain binds a second epitope independently of a cognate variable domain, wherein the first and second epitopes are not identical; and (iii) a third nucleic acid encoding a human universal light chain variable domain fused to a non-human or human light chain constant domain.
 32. The method of claim 31, wherein the first and second heavy chain constant region genes are human heavy chain constant region genes, and wherein the light chain constant domain is a human light chain constant domain.
 33. An antigen-binding protein made according to the method of claim
 31. 